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1. Introduction 

There are many beautiful and important configurations of points on the surface 
of a sphere, ranging from familiar examples such as the vertices of an icosahedron 
to more subtle cases such as the minimal vectors of the Eg or Leech lattices. Yudin, 
Kolushov, and Andreev have shown in IKYll IKY2I lAnll IAn2| that certain con- 
figurations, including the ones mentioned above, minimize specific sorts of energy. 
In this paper, we prove that in fact they minimize a far broader class of energies 
than was previously known; we call this property universal optimality (the precise 
definition is given below). Our proofs apply to a number of other configurations. 
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enumerated in Tabled below, as well as to energy minimization in other compact 
two-point homogeneous spaces or Euclidean space. 

The setting of this paper is potential energy minimization. If one constrains 
some identically charged particles to move on the surface of a sphere, how will they 
arrange themselves? They will eventually spread out so as to locally minimize their 
electrical potential energy, assuming their kinetic energy dissipates according to 
a force such as viscosity. We will ignore questions of dynamics and simply focus 
on the relative positions of the particles once they reach a minimum for potential 
energy. Furthermore, we will focus on global minima, although particles can indeed 
become trapped in a local minimum. 

This question leads to a fundamental problem in extremal geometry, in which 
we abstract the key points by working in an arbitrary dimension and replacing 
the electrical potential with an arbitrary radial potential function. Let / : (0, 4] — > 
[0, oo) be any decreasing, continuous function. Given a finite subset C of the unit 
sphere 5*""^ C M", define the potential energy (more precisely, the f -potential 
energy) of C to be 

Because each pair x, y is counted in both orders, our potential energy is twice that 
from physics. We normalize it this way because it makes the formulas prettier. It 
is also important to note that we view potential functions as functions of squared 
distance. 

For a fixed number iV > 1, how can one choose C with \C\ = so as to minimize 
the potential energy? (To ensure that a minimum exists, when limr^o+ /('') is 
finite, we must allow degenerations in which points in the configuration coincide.) 
In most cases the answer is some complicated configuration, but it is remarkable 
how many important structures minimize potential energy. 

In M.^ the electrical potential corresponds to /(r) = l/r^/^, and a natural gen- 
eralization to K" with n > 3 is /(r) = That is an especially important 
power law, because x i-^ is a harmonic function on M" \ {0}. However, 
any power law is of interest, as are more general potentials. 

Many researchers have studied potential energy minima. One of the most beau- 
tiful rigorous approaches is due to Yudin in jY| . He studied the harmonic power law 
and used spherical harmonics to give sharp lower bounds for potential energy for 
n+1 or 2n points on S"~^ (the optima are the regular simplex and cross polytope). 
This work was extended by Kolushov and Yudin in |KYlj . For {n,N) = (8,240), 
they showed that the minimal vectors in the Eg, root lattice are the unique global 
minimum. Andreev applied their techniques in |An2l IAnl| to show that the Leech 
lattice minimal vectors are the unique minimum for (n, TV) = (24, 196560) and the 
vertices of the regular icosahedron are the unique minimum for (n, N) — (3,12). 
In the present paper, we extend this approach to much more general potential 
functions, as well as numerous other point configurations (listed in Table 

The paper |KY2| analyzed arbitrary power laws for the simplex, cross polytope, 
and Eg root system in a closely related maximization problem (maximizing poten- 
tial energy given an increasing potential function) and raised the question of how 
generally the techniques could be applied. We will deal with quite general potential 
functions. Without loss of generality one can assume that the potential function 
is nonnegative (only distances between and 2 occur on the sphere, and one can 
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add a constant to ensure nonnegativity over that range). The potential funetion 
must be decreasing, which corresponds to a repulsive force, and it is natural to 
require convexity as well. Our results require a strengthening of these conditions, 
namely complete monotonicity. Recall that a C°° function /:/ ^ R on an in- 
terval / is completely monotonic if (— 1)*^/'^'^^ (x) > for all x £ I and all fc > 
(see Wid, p. 145]) and strictly completely monotonic if strict inequality always 
holds in the interior of /. Of course, derivatives at endpoints of intervals denote 
one-sided derivatives. In this paper, endpoints are irrelevant, because we use half- 
open intervals closed on the right, and one can show using the mean value theorem 
that complete monotonicity on the interior implies that it holds also at the right 
endpoint. 

If a completely monotonic function fails to be strictly completely monotonic, 
then it must be a polynomial: if /'^'^■' (x) = 0, then complete monotonicity implies 
that Z^*^) (y) = for aU y > X, in which case /^'^■' is identically zero because it is 
analytic (Theorem 3a in |Widl p. 146]). 

All inverse power laws /(r) = l/r"^ with s > are strictly completely monotonic 
on (0, oo), and they are the most important cases for our purposes. Other examples 
include /(r) — e~'^'^ with c > 0. 

It might seem preferable to use completely monotonic functions of distance, 
rather than squared distance, but squared distance is in fact more natural than it 
first appears. It simplifies formulas appearing in later sections of this paper, it fits 
into the general framework described in Section|Hl and it is more general than using 
distance: it is not hard to prove that if r /(^^) is completely monotonic on a 
subinterval (a, b) of (0, oo), then / is completely monotonic on (a^, 6^), but not vice 
versa. 

We will examine certain special configurations of points on high-dimensional 
spheres, which were introduced by Levenshtein in |Lev2| (in which he proved that 
they are optimal spherical codes). We refer to them as the sharp arrangements, 
because everything fits into place so perfectly that many bounds turn out to be 
sharp. Each is a spherical code with certain parameters: an {n, N, t) spherical code 
is a set of N points on S*""^ such that no two of them have inner product greater 
than t (unless they are equal). In other words, all angles between them are at least 
cos~^ t. Each sharp configuration is also a spherical M-design for some M; recall 
that a spherical M-design is a finite subset of the sphere such that every polynomial 
on M" of total degree at most M has the same average over the design as over the 
entire sphere. 

Definition 1.1. A finite subset of the unit sphere S^~^ is a sharp configuration 
if there are m inner products between distinct points in it and it is a spherical 
{2m — l)-design. 

Replacing 2m — 1 with 2m -I- 1 in Definition 11.11 would be impossible, for the 
following reason. If ti, . . . ,tm are the inner products that occur between distinct 
points in a configuration and y is a point in the configuration, then the polynomial 

m 

x^{l-{x,y))l[{{x,y)~t.,f 

i=l 

of degree 2m -|- 1 vanishes on the entire configuration. However, its integral over the 
sphere does not vanish, because the polynomial is nonnegative on the sphere and 
not identically zero. Thus, sharp configurations are spherical designs of nearly the 
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Table 1. The known sharp configurations, together with the 600-ceU. 
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greatest possible strength given the number of distances occurring in them. (Some 
but not all sharp configurations are actually 2m-designs.) 

All sharp arrangements that we know of are listed in Table together with 
the 600-cell (which is not sharp, but to which our techniques nevertheless apply). 
The columns list the dimension n of the ambient Euclidean space, the number N of 
points, the largest M such that the code is a spherical M-design, the inner products 
other than 1 that occur between points in the code, and the name of the code, if 
any. If t denotes the largest inner product, then each of these codes is the unique 
(n, N, t) spherical code, except for some of those listed on the last line of the table 
(the isotropic subspace codes). See Appendix IXI for details about uniqueness. 

The first six configurations listed in Table ^ are the vertices of certain full- 
dimensional regular polytopes (specifically, those regular polytopes whose faces 
are simplices), together with lower-dimensional simplices. The next seven cases 
are derived from the Es root lattice in and the Leech lattice in M.'^^. The 
240-point and 196560-point configurations are the minimal (nonzero) vectors in 
those lattices. In sphere packing terms, they are the kissing configurations, the 
points of tangency in the corresponding sphere packings. Each arrangement with 
the label "kissing" is the kissing configuration of the arrangement above it: each 
configuration yields a sphere packing in spherical geometry by centering congruent 
spherical caps at the points, with radius as large as possible without making their 
interiors overlap. The points of tangency on a given cap form a spherical code 
in a space of one fewer dimension. In general different spheres in a packing can 
have different kissing configurations, but that does not occur here because each 
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configuration's automorphism group acts transitively on it. (See Chapter 14 of 
|CS| for the details of these configurations.) Some of the kissing configurations 
are of independent interest. For example, the 27 points on form the Schlafli 
arrangement, which is connected as follows with the classical configuration of 27 
lines on a cubic surface. Let X be a smooth cubic surface in CP'^. Its divisor class 
group Pic(X) is a free abelian group of rank 7, equipped with a bilinear form of 
signature (1,6) given by taking intersection numbers (see Proposition 4.8 in [Hj 
p. 401]). The class h of a. hyperplane section has self-intersection number = 3, 
so its orthogonal complement h-^ is negative definite. Thus, after changing the sign 
of the metric, ®z K is naturally a Euclidean space. Orthogonally projecting 
the divisor classes of the 27 lines on X from Pic(X) ®z ^ to ®i E yields 27 
points on a sphere, which form the Schlafli configuration (up to scaling). The sharp 
configurations with 240, 56, or 16 points have similar interpretations via exceptional 
curves on del Pezzo surfaces; see Corollary 25.1.1 and Theorem 26.2 in |Man| for 
details. 

The sharp configuration of 552 points in M'^'^ conies from an equiangular arrange- 
ment of 276 lines in R^'^ described by the unique regular two-graph on 276 vertices; 
see Chapter 11 of |GR,j . It can be derived from the Leech lattice A24 as follows. 
Choose any w G A24 with |wp = 6. Then the 552 points are the vectors v G A24 
satisfying |wp — \w — — 4. These points all lie on a hyperplane, but it does 
not pass through the origin, so subtract w/2 from each vector to obtain points on 
a sphere centered at the origin. 

The Higman-Sims configuration of 100 points in M^^ can be constructed simi- 
larly, following (WUj . Choose wi and W2 in A24 satisfying = |'fi'2p — 6 and 
\wi — t«2p = 4. Then there are 100 points v € A24 satisfying jwp = \wi — up = 
|w2 — up = 4, and they form the Higman-Sims configuration (on an affine subspace 
as above). 

One might imitate the last two constructions by using the Eg lattice instead of 
the Leech lattice and replacing the norms 6 and 4 with the smallest two norms in 
Eg, namely 4 and 2. That works, but it merely yields the cross polytope in and 
the simplex in M^. 

The last line of the table describes a remarkable family of sharp configurations 
from jCGS| . They are the only known sharp configurations not derived from regular 
polytopes, the Eg, lattice, or the Leech lattice. The parameter q must be a prime 
power. When q — 2, this arrangement is the Schlafli configuration (and it is a 
spherical 4-design), but for g > 2 it is different from all the other entries in the table. 
Points in the configuration correspond to totally isotropic 2-dimensional subspaces 
of a 4-dimensional Hermitian space over ¥q2 , with the distances between points 
determined by the dimensions of the intersections of the corresponding subspaces 
(inner product —l/q corresponds to intersection dimension 1). This construction 
generalizes to higher-dimensional Hermitian spaces, but the 891-point configuration 
in appears to be the only other sharp configuration that can be obtained in this 
way, from totally isotropic 3-dimensional subspaces of a 6-dimensional Hermitian 
space over F4. 

Antipodal sharp configurations are the same as antipodal tight spherical designs, 
by Theorem 6.8 in DCS . Much progress has been made towards classifying such 
designs; see jBMVj for more details. In particular. Table ^ contains all of the 
antipodal sharp configurations in at most 103 dimensions. 
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Tabic n coincides with the hst of known cases in which the hnear programming 
bounds for spherical codes are sharp; the list comes from |Lev8l p. 621], except 
for the 600-cell, which was dealt with in jAn3| and [El]. We conjecture that our 
techniques apply to a configuration if and only if the linear programming bounds 
for spherical codes prove a sharp bound for it. 

In our main theorem, we prove that sharp arrangements and also the 600-cell are 
global potential energy minima for every completely monotonic potential function: 

Theorem 1.2. Let /: (0,4] —> M be completely monotonic, and let C C 5"^^ be a 
sharp arrangement or the vertices of a regular 600-cell. If C C S"~^ is any subset 
satisfying \C'\ = \C\, then 

(1.1) E fi\--yn> E /(l^-yH- 

If f is strictly completely monotonic, then equality in Hl.l|l implies that C is also 
a sharp configuration or the vertices of a QOO-cell (whichever C is) and the same 
distances occur in C and C . In that case, if C is listed in Tabled but not on the 
last line, then C = AC for some A G 0(n) (i.e., C and C are isometric) . 

Uniqueness does not necessarily hold if the hypothesis that / must be strictly 
completely monotonic is removed (consider a constant potential function) . Further- 
more, the configurations from the last line in Table ^a-re not always unique. The 
construction in |CGS| builds such a configuration given any generalized quadran- 
gle with parameters (q,q^). For g < 3 such quadrangles are unique (see Section 9 
in jCGS| ). as are the spherical codes derived from them, but for larger q that is 
not always true. Whenever distinct generalized quadrangles exist, the construction 
yields distinct spherical codes. In particular, as explained in CGS^, one can con- 
struct such quadrangles from ovoids in F^P"^. When q — 2'^ with k odd and k > 1, 
there are at least two distinct ovoids, namely the elliptic quadric ovoid and the 
Suzuki-Tits ovoid (see [Tj). They yield distinct generalized quadrangles and thus 
distinct universally optimal spherical codes. 

To illustrate the techniques used to prove Theorem 11.21 in this section we will 
prove the simplest possible case, namely when \C\ < n + 1 (in which case C is a 
simplex) . The full proof is in Section |B| 

Proof of special case. Let N — \C\, and suppose N < n + I. Then C is a regular 
simplex with inner product —1/{N— 1) between distinct points, so the squared 
Euclidean distance between distinct points is 2 -I- 2/ (TV — 1) (because — = 
2 — 2{x,y) when = |yp — 1). For this special case, we only require / to be 
decreasing and convex, rather than completely monotonic. 
Let 

h{x) = f{2 + 2/{N - 1)) + /'(2 + 2/{N - \)){x - (2 + 2/{N - 1))), 

so h is the tangent line to / at 2 -f 2/{N — 1). Because / is convex, f{x) > h(x) 
for all X £ (0,4], and if / is strictly convex, then equality holds if and only if 
x = 2 + 2/{N -1). 

Suppose C is any subset of S"-'^ with |C'| = N. Then 

E f{\-~yn> E H\--yn^ 

x,y£C' ,x^y x,y&C' ,x^y 
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and the right side equals 

7V(7V-l)/(2 + 2/(7V-l)) 
- (2 + 2I{N - l))N{N - l)/'(2 + 2/{N - 1)) 

+ /'(2 + 2/(7v-i)) J2 \^-y\ 
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J, - y I . 

x,yeC' 
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J2 \x-y\^ ^ ^ {2~2{x,y)) 

x,y£C' x,y£C' 



2N'^ 



2 



< 2N'^. 

Because f'(2 + 2/{N — 1)) < 0, we find that the potential energy of C is at least 

N{N-l)f{2 + 2/{N~l)) 
- (2 + 2/{N - 1))N{N - l)/'(2 + 2/{N - 1)) 
+ 27VV(2 + 2/(7V- 1)), 

which equals 

7V(7V-l)/(2 + 2/(7V-l)). 
That is the potential energy of C, so we have proved that 

x,y(^C',x^y x.y^C.x^y 

When / is strictly convex, equality holds only if |x — j/p = 2 + 2/{N — 1) for all 
x,y € C with x ^ y, in which case C is a regular simplex. □ 

We use the term universal optimality to describe the conclusion of Theorem 1 1.21 

Definition 1.3. A finite subset C C 5""^ is universally optimal if it (weakly) 
minimizes potential energy among all configurations of \C\ points on S"^^ for each 
completely monotonic potential function. 

As the name suggests, every universally optimal spherical code is indeed an 
optimal spherical code, in the sense of maximizing the minimal distance between 
points. For the potential function f{r) — l/r'*, the leading term in the large-s 
asymptotics of the potential energy comes from the minimal distance. If there were 
a spherical code with the same number of points but a larger minimal distance, 
then it would have lower potential energy when s is sufficiently large. 

As we mentioned above, the optimality of sharp configurations as spherical codes 
is not new. Levenshtein proved in |Lev2| that every sharp configuration is a maximal 
spherical code, by using linear programming bounds. For the Eg and Leech minimal 
vectors, this amounts to the solution of the kissing problem by Levenshtein and 
independently by Odlyzko and Sloane in [Levll IOS | ; see |Lev3| for an overview of 
the other cases, except for the 600-cell, for which optimality follows from the bound 
in jBoj (see also |An8| and [eI] for a solution using linear programming bounds). 
Our theorem can thus be viewed as a continuum of optimality results connecting 
the kissing problem and Levenshtein's theorem to the results of |KY1| and |An2j . 
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Certain other known optimality results can also be derived. For example, |KY1| 
and jAn2j prove that the Es and Leech minimal vectors maximize the sum of the 
distances between pairs of points. That is the case /(r) = 2 - r^/^ in TheoremO 
Similarly, taking f(r) = log(4/r) shows that sharp configurations maximize the 
product of the distances between pairs of distinct points. 

When testing whether a configuration is universally optimal, it is helpful to be 
able to restrict to a smaller class of potential functions. Theorem 9b in Wid.. p. 154] 
implies that on each compact subinterval of (0, 4], every completely monotonic func- 
tion on (0, 4] can be approximated uniformly by nonnegative linear combinations 
of the functions r i— > (4 — r)'' with fc e {0, 1, 2, . . . }. For example, 

1 _ /s + fc - 1\ (4-r)'^' 

j,s Z-^ \ k I 4'^+* 
fc>0 ^ ^ 

Thus, a configuration is universally optimal if and only if it is optimal for each of 
the potential functions /(r) = (4 — r)'^ . 

The conclusion of optimality under all completely monotonic potential functions 
is quite strong, and computer experiments suggest that it is rarely true in other 
cases. For example, it provably fails in the case of five points on S^. Schiitte and 
van der Waerden proved in |SchWj that the minimal angle in such a code cannot 
be greater than 7r/2. Their analysis shows that all optimal codes consist of a pair of 
antipodal points with three points on the orthogonal plane. Among such codes, the 
only one that could be universally optimal is the one in which the three points form 
an equilateral triangle. However, when f{r) = (4 — r)^, there is a configuration with 
lower energy: take a single point together with four points equidistant from it and 
forming a square in the opposite hemisphere (the size of the square is chosen so as 
to minimize energy). Thus, the only candidate for a five-point universal optimum 
is not universally optimal. Furthermore, it is not even a local minimum for energy 
when f{r) = (4 — r)^ with k sufficiently large or /(r) — with s sufficiently 
large. 

Leech proved a far stronger theorem in jLeej . He classified the configurations 
on S'2 that are in equilibrium under all force laws, which is a necessary but not 
sufficient condition for universal optimality. His theorem implies that there are no 
universal optima in M.^ beyond those in Table ^ by ruling out all cases except for 
five points (see |Leel p. 89]). We conjecture that for each n > 3 there are only 
finitely many universal optima in R", but we cannot prove it even for n = 4. 

We do not know of any other spherical codes that could be added to Tabled but 
some may well exist. One surprising case in which universal optimality fails is the 
vertices of the regular 24-cell on (equivalently, the D4 root system) . It is not a 
sharp configuration, because it has four inner products besides 1 while it is only a 
spherical 5-design, but one might still hope that it would be universally optimal. It 
appears to be optimal for many inverse power law potentials, but even this weaker 
conclusion is surprisingly subtle: for example, when one simulates 24 randomly 
chosen points on under the potential function /(r) = more than 90% of the 
time one reaches a local optimum with potential energy 334.096 . . . (an algebraic 
number of degree 20), which is only slightly worse than the potential energy 334 of 
the 24-cell. In other words, there is a suboptimal arrangement with a much larger 
basin of attraction. This arrangement belongs to an explicit one-parameter family 
of configurations analyzed in |CCEK| . In that paper, some of them were shown 
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to improve on the 24-cell for certain completely monotonic potential functions, 
including /(r) = (4-r)'' with 8 < fc < 13 and /(r) = e^^*". The 24-cell is therefore 
not universally optimal. 

The remaining regular polytopes (the cubes in R" for n > 3, the dodecahedron 
in R'^, and the 120-cell in R^) are also not universally optimal, for a less interesting 
reason: they are not even optimal spherical codes. Antiprisms improve on cubes, 
and |HI] deals with the other two cases. Thus, a full-dimensional regular polytope 
is universally optimal if and only if all its faces are simplices. 

One might also wonder about the (4,10,1/6) code that is the kissing configu- 
ration of the universally optimal (5, 16, 1/5) code. It consists of the midpoints of 
the edges of a regular simplex. This code is not universally optimal, because for 
the potential functions /(r) = (4 — r)^ with 3 < fc < 6, arranging two regular pen- 
tagons on orthogonal planes yields lower potential energy. For k < 2 their potential 
energies agree; for fc > 7, the (4,10,1/6) code is superior to the two pentagons, 
but we do not know whether it is optimal in these cases. Note that the kissing 
configuration of this (4, 10, 1/6) code is a (3,6, 1/7) code that is not even optimal, 
let alone universally optimal. 

The (4, 10, 1/6) code described above is a spherical 2-design in which only two 
inner products occur between distinct points. It follows that replacing the bound 
2m — 1 with 2rn — 2 in Definition 11.11 would make Theorem 11.21 false . 

Probably the (21, 336, 1/5) kissing configuration of the (22, 891, 1/4) code is not 
universally optimal, but in this case we do not know a better configuration. We 
also do not know whether universal optimality holds for the (20, 105, 1/8) kissing 
configuration of the (21, 162, 1/7) code or the (21, 77, 1/12) kissing configuration of 
the (22, 100, 1/11) code. These configurations are not sharp, so Theorem II . 21 does 
not apply. 

We do not know whether there exist universally optimal spherical codes whose 
universal optimality cannot be proved by our methods. It seems likely that such 
codes exist, but we have no candidates to propose. 

Another important question not addressed by our methods is whether there are 
other local minima for potential energy in the cases listed in Table ^ In certain 
special cases the existence of other minima is trivial; for example, for /(r) = 4 — r, 
a configuration minimizes potential energy if and only if its centroid is the origin, so 
there is a positive-dimensional space of global minima (when iV > 3). In general, 
that is true for /(r) = (4 — r)*^ if fc is small relative to the number of points in 
the configuration. (It was proved in SZj that spherical fc-designs exist on 5""^ for 
all sufficiently large numbers of points. Spherical fc-designs automatically minimize 
energy for /(r) — (4 — r)*^, and because the set of such designs is closed under 
rotation and taking unions they exist in positive-dimensional families once the 
number of points is large enough.) For sufficiently large fc or for inverse power 
laws, the set of local minima appears to be finite, but sometimes there are several. 
When the number of points is large, there can be vast numbers of local minima. 
For example, experiments suggest that for 120 points on under the harmonic 
power law potential, there are more than five thousand distinct local minima, and 
perhaps far more than that. 

It is natural to ask whether the requirement of complete monotonicity could be 
weakened. For example, regular simplices are optimal for all decreasing, convex 
potential functions, although not for all decreasing potential functions. That does 
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not always hold. Specifically, the icosahedron is not even a local minimum for 
the decreasing, convex potential function f{r) ^ ~ 48r + 128, but rather a 
saddle point. One can show using Proposition 14.11 that the global minimum for 
twelve points in is obtained degenerately by letting them coincide in triples at 
the vertices of a tetrahedron. More generally, repeating the vertices of a regular 
simplex in K." is optimal whenever the number of points is a multiple of n+ 1. This 
potential function therefore behaves quite differently from completely monotonic 
potential functions. 

The potential function f{r) = — 48r + 128 from the previous paragraph is 
convex, but r i—> flr"^) is not (so the potential energy is not convex as a function of 
distance, as opposed to distance squared). At the cost of slightly greater complexity 
one can obtain that stronger property: if f{r) = — 7 • 2^'^ ■ ^Jr + 13 • 2^"*, then 
r I— >■ /(r^) is decreasing and convex, as is /, but the icosahedron is not a local 
minimum for potential energy. 

Although it is not enough to assume that the potential function is decreasing 
and convex, in each case only finitely many derivative conditions are needed. For 
each sharp configuration, our methods produce an explicit bound K such that 
optimality holds for a potential function / g C^((0,4]) whenever (— l)'"/'-'^-' — 
for all r E (0,4] and k < K. We will not discuss this issue further, but it is worth 
recording the result: for a sharp configuration with m inner products between 
distinct points, we can take K = 2m; furthermore, if (— l)'^/^'^-' (r) > for all 
r G (0, 4) and k < 2m, then every configuration of the same size and dimension 
with this potential energy is sharp (and the same distances occur between points 
in it as in the original configuration). For antipodal configurations a more refined 
construction can be used to reduce both of these bounds by 1 , but it is implausible 
that they could be substantially reduced using our methods. 

We conclude the introduction with one negative result in arbitrary dimensions, 
namely that there are no universal optima in between the simplex and cross poly- 
tope: 

Proposition 1.4. Ifn+l < N < 2n, then there is no N-point universally optimal 
configuration on 

Proof. An optimal code of this size has the following structure (see, for example. 
Theorem 6.2.1 in |B6Jr| and the remark following it). There are N — n pairwise 
orthogonal subspaces of R" whose span is M", each containing d + 1 points of 
the code if its dimension is d. If the code is universally optimal, then each of 
these subspaces must contain a regular simplex: because of orthogonality, distances 
between points in different subspaces are constant, so within each subspace we must 
have a universally optimal code. For the same reason, every union of some of the 
component simplices must be universally optimal as well. Thus, without loss of 
generality we can assume that N — n + 2. We can also assume that n> A because 
the case of n = 3 and N = 5 was dealt with earlier in this section. 

Suppose the two regular simplices have i + 1 and n ~ i + 1 points. For the 
potential function f{r) = (4 — r)'', the energy of the configuration equals 

^(^ + l)(^2-^^ + {n - i){n - t + 1) (^2 - + {i + l){n - i + l)2''+\ 

For k = 2 that equals An{n + 1) — An/(i{n — z)), from which it follows that the 
potential energy is minimized exactly when i — [n/2\ or i ~ \n/2~\. As fc — )■ oo, 
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the energy is asymptotic to {i + l){n — i + 1)2'^'+^, which is minimized exactly when 
i = lori = n— 1 (and hence not at i — [n/2j or i = \''^/'^^ i because n > 4). Thus, 
universal optimality fails, as asserted. □ 

2. Background 

In this section we briefly review the tools required for the proof of Theorem 1 1.21 



2.1. Hermite interpolation. Recall that Hermite interpolation is a generaliza- 
tion of Lagrange interpolation in which one computes a polynomial that matches 
not only the values but also some derivatives of a function at some specified points. 
More precisely, suppose we are given / G C°°([a,6]), distinct points ti,...,tm € 
[a, b], and positive integers fci, . . . , k^. We wish to find a polynomial of degree less 
than D — ki + k2 + ■ ■ ■ + km such that for 1 < i < m and < k < hi, 

pW(t,) = /W(i.)- 

In other words, p and / agree to order ki at i^. A unique such polynomial always 
exists, because the linear map that takes a polynomial p of degree less than D to 

{p{t,),p'{t,), . . . ,p('=^-l)(ti),p(t2),p'(i2), . . . ,p^''--'\t„,)) 

is injective (and hence surjective as well). 

One fundamental fact about Hermite interpolation is the following remainder 
formula (see Theorem 3.5.1 in ^ p. 68]): 

Lemma 2.1. Under the hypotheses above, for each t G [a,b] there exists ^ € (a, 6) 
such that min(t, ii, . . . , t„i) < < max(t, ti, . . . , tm) and 

fit) - Pit) = ^^it " ti)"' ■■■{*- tm)"-. 

Because the standard proof is short, we reproduce it here for the convenience of 
the reader: 



Proof. Let 

^ ^_m-p{t^__ 

' {t-t^Y^...{t~tmYr.- 

For fixed t G [a, b] \ {ti, . . . , t„i} (the lemma is trivial for t £ {ti, . . . , tm}), consider 
the function 

s ^ /(s) _ p(s) - g{t)is - t^f^ ...{s- tmf^. 

By construction, for each i it vanishes at ti to order fc^, and it also vanishes at so 
it has D + 1 roots in the interval 

[min(i, ii, . . . , i^), max(t, ti, . . . ,tm)]. 

By iterated use of RoUe's theorem, there exists ^ in the interior of this interval at 
which the D-th derivative vanishes. At that point, 

/(^)(0-3(t)i?! = 0, 

which gives the desired formula for g{t). □ 
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This formula will play a fundamental role later in the paper, but we will also need 
a stronger result. Recall that a function is absolutely monotonic on an interval if it is 
C°° and it and all its derivatives are nonnegative on that interval (see jWidl p. 144]). 
Unfortunately, the terms "completely monotonic" and "absolutely monotonic" are 
easy to confuse, but they are standard terms. As a mnemonic, associate "absolutely 
monotonic" with "absolute value" to remember which one has no signs. We call a 
function strictly absolutely monotonic on an interval if it and all its derivatives are 
nonnegative on the interval and strictly positive on its interior. 

Proposition 2.2. Under the hypotheses above, suppose further that f is absolutely 
monotonic on (a, 6). Then 

fit) -Pit) 
{t - ti)^^ ...{t- t^)^^ 

is also absolutely monotonic on (a, 6). // / is strictly absolutely monotonic, then 

fit) -Pit) 

{t - ...{t- i„o^-- 

is as well. 

Of course, when we write quotients such as this one, we define them by continuity 
when t = ti for some i so that they become C°° functions. It seems plausible that 
Proposition 12.21 is known, but we do not know where to find it in the literature. 
The hypothesis that ti,. . .,tm lie in the interval [a,b] is crucial (it is part of the 
setup above, but it is worth reiterating it here). 

Before proving Proposition l2.2l it is convenient to introduce systematic notation 
for Hermite interpolations. Given a polynomial g with deg{g) > 1, let H{f,g) 
denote the polynomial of degree less than deg{g) that agrees with / at each root 
of g to the order of that root. (If / is a polynomial, then H{f,g) is its remainder 
modulo g in polynomial division.) In other words, if g{t) vanishes to order k at 
t = s, then 

f{t)-H{f,g){t) = 0{{t-s)'^) 

as t — > s. It follows that the function Qif,g) defined by 

fit)-Hif,g)it) 
Qif,9)it) = — 

extends to a C°° function at the roots of g. Proposition 12 . 21 is the assertion that if 
g{t) = {t — ti)^'^ . . .(t — tm)^"^ : then Qif,g) is absolutely monotonic. 

Proof. The proof rests on two key properties. The first is that 

Qif,9i92) = QiQif,9i),92), 
which follows from the uniqueness of the Hermite interpolation: 
nfntf \ \ f-Hif,gi)-g,H{Q{f,g,),g2) 

QiQij,gi),92) = , 

9i92 

but iJ(/, gif/2) is the unique polynomial of degree less than deg(5i) + deg{g2) 
such that (/ — H{f,gig2))/igig2) extends to a C°° function everywhere. Thus, 
Hif, g,g2) = Hif, g,) + giH{Q{f, g^), 52) and Q{f, g^g^) = g(Q(/, ffi), 52). 
The second property is that if gviit) = it — sq)", then 

<y(/,5o)(so) = — i — • 
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This follows immediately from the Taylor expansion. 
We must show that for n > and t £ (a, b), 

Q(/,5)("ni)>0. 

When n = 0, that follows from Lemma ITTl which asserts the existence of ^ g (a, b) 
(depending on t) such that 

deg(5)! 

For larger n, given sq € (a, 6), define go{t) ~ {t — so)"- Then 

= Q(Q(/,5),5o)(so) = 0(/,5<?o)(so) > 0. 

This completes the proof when / is absolutely monotonic. The case of strictly 
absolutely monotonic functions works exactly the same way. □ 

It follows immediately from continuity that the statement of Proposition lT^ also 
holds when the open interval (a, b) is replaced with a half-open or closed interval. 

2.2. Positive-definite kernels. A continuous function K : 5"^^ x 5*"^^ ^ M is a 
positive- definite kernel if for all k and all xi, . . . , £ 5"^^, the fc x fc matrix whose 
I, J entry is K{xi, Xj) is positive semidefinite. In other words, for alHi, . . . , tk € M, 

titjK{xi,Xj) > 0. 

l<i,j<k 

The most important special case is when ti — ■ ■ ■ ^ t^ = I', for every finite subset 
C C S"'-!, 

K{x,y)>0. 

x,y£C 

Positive-definite kernels K such that K{x, y) depends only on the distance |a; — ?/| 
between x and y will be particularly important. There is a simple representation- 
theoretic construction of such kernels, which Schoenberg proved gives all of them 
(see |Schp . As a unitary representation of 0(n), the Hilbert space L'^{S^~^) breaks 
up as a completed orthogonal direct sum of infinitely many finite-dimensional rep- 
resentations: 



jn— 1 



where Vi is the space of spherical harmonics of degree £ (i.e., restrictions to S""^ of 
homogeneous polynomials on M" of degree £ that are in the kernel of the Euclidean 
Laplacian). Note that spherical harmonics of different degrees are orthogonal, which 
will play a useful role later. It is also important to know that every polynomial on 
M" has the same restriction to S"'~^ as some unique linear combination of spherical 
harmonics. For more details on spherical harmonics, see §2 of Chapter IV in |StWj . 

For each £ > and each x e S"""^ , there is a unique reproducing kernel evg^x G Vi, 
defined by requiring that for all / G V«, 

{Lg^I,x) = f{x). 
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(We use Hermitian forms that are conjugate-linear in the second variable.) To 
construct evt^x explicitly, choose an orthonormal basis ei^i, . . . ,ei^d\xnVi of ^ ^-^^'^ 
define 

evf,x(y) = ei,i(x)ets{v) H \- e^.dimyf (a;)e/^,dimy^(2;)• 
Define the function Ki>: S*""^ x 5"-^ ^ C by 

Ki{x,y) = (ev£^^,ev£_j^) = ev£^^(y). 

It is straightforward to check that evt^Ax = ev^ ajoA^^ for A e 0{n). Because 
0{n) acts distance-transitively on S*"^^ (i.e., if |u — — \x — y\, then there exists 
A g 0{n) such that Au = x and Av = y), Ki{x^y) depends only on the distance 
between x and y. That implies that it is real- valued, since 



Ki{x,y) = Ki{y,x) 

follows immediately from the definition while K^{x^y) = Ke{y,x) since \x — y\ 

\y-x\. 

The key property of Ki is that it is a positive-definite kernel: 

2 



^ UtjKi{xi,Xj) = 



l<i,j<k 



l<i<k 



> 0. 



All convergent nonnegative infinite linear combinations of these functions are still 
positive-definite kernels, and Schoenberg proved that those are the only continuous 
positive-definite kernels that depend only on the distance between points. 

One can compute Ki explicitly. Because Kg(x^y) depends only on the distance 
between x and y, we can write Ki{x,y) = Cg{{x,y)) for some function Cg. To 
compute Cf, we use the orthogonality between spherical harmonics of different 
degrees. If we fix a point on the sphere and project orthogonally onto the line 
through the origin and that point, then the surface measure on the sphere projects 
to a constant times the measure 

(l_t2)(n-3)/2^^ 

on the interval [—1,1], and the orthogonality property amounts to saying that 
Co, Ci, . . . are orthogonal polynomials with respect to that measure, with Ci having 
degree i. That determines these functions up to multiplication by a positive scalar 
(positive because Ci(l) > 0), and pinning down that factor is irrelevant for our 
purposes. 

These orthogonal polynomials are known as the ultraspherical (or Gegenbauer) 
polynomials. The standard notation is C^{t) for the degree i polynomial orthogonal 
with respect to (l-t2)^-i/2 [-1, 1] (for 5*"-^ A = n/2-1). These polynomials 
are normalized so that Cq (t) = 1 and C^(i) — 2\t, and they satisfy the recurrence 

for i > 2. See |AAR,I p. 303] for more details on ultraspherical polynomials, as 
well as Chapter 9 of jAARj for spherical harmonics. We always assume A > 
(equivalently, n > 2), because the one-dimensional case is trivial and its ultra- 
spherical polynomials are poorly normalized: above degree they have negative 

— 1/2 

leading coefficients, and Ci is not a positive-definite kernel. 

Whenever we refer to the ultraspherical coefficients of a function, we mean its 
coefficients in terms of ultraspherical polynomials (where the parameter A is implicit 
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in the context). A positive-definite function will be a function whose ultraspherical 
coefficients are nonnegative. 

One fundamental fact about positive-definite functions is that they are closed 
under taking products. Equivalently, the product of two ultraspherical polynomials 
is a nonnegative linear combination of ultraspherical polynomials (for A > 0). By 
orthogonality, proving that amounts to showing that 

(2.1) / C,{t)Cj{t)Ck{t){l-t^Y"-^'>/^dt>0 



1 

'n— 1 



for all i,j,k > 0. Let /i denote the surface measure on 5" . Then using the 
expansion 

dim Vi 

Ki{x,y) ^ ^ et.m{x)ei^m{y) 

m—l 

in terms of a choice of orthonormal basis e^^i, . . . , e^^dim Vf for each Vi shows that 
l|2.1|) holds if and only if 

dim Vi dim dim Vk „ 

X! X! X! <^tA^)(^j,b{^)'^kA^) dfJ'ix) Ci^aiy) ej^b{y) ek,c{y) > 

a=l b=l c=\ •' 

for some (equivalently, all) y € S'^~^ . Integrating over y yields 

2 

e»,a(a;)ej,{,(a;)efe,c(a;)(iAi(a;) , 



dim Vi dim Vj dim V^. 

E E E 

a=l b=l c=l 

which is visibly nonnegative. 

This result also follows from the Schur product theorem: the set of positive 
semidefinite matrices of a given size is closed under taking Hadamard products 
(i.e., taking the product entry by entry). See Theorem 7.5.3 in |HJ[ p. 458]. That 
approach leads to the proof of Schoenberg's characterization of positive-definite 
kernels. Suppose [x,y) ^ f{{x,y)) is a positive-definite kernel. By the Schur 
product theorem, the same is true for {x,y) i-^ f{{x,y))Ci{{x,y)) for each i. It 
follows that the integral of this function over {x,y) G S'^~^ x S'^~^ is nonnegative, 
and hence for fixed y the integral over x £ S"""^ is nonnegative (because it is 
independent of y). In other words, the i-th ultraspherical coefficient of / must be 
nonnegative. 

We call a polynomial strictly positive definite if all its ultraspherical coefficients 
are strictly positive (up to its degree, of course). The product of strictly positive- 
definite polynomials is strictly positive definite; the proof rests on the fact that 
in the expansion of C^Cj, the coefficient of C^_^j is positive (because all these 
polynomials have positive leading coefficients). 

One useful application of ultraspherical polynomials is the following simple test 
for spherical design strength (Theorem 5.5 in |UGS| 1: a configuration C C 5*"^^ is 
a spherical M-design if and only if 

(2.2) J2 c:'^~\{^.y)) = ^ 

x,yeC 

for 1 < i < M . That property depends only on the distance distribution of C (i.e, 
which distances occur between points in C and the multiplicities of those distances). 
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To prove this characterization of designs, observe that the identity 

2 



x.y^C x.y^C 



shows that (|2.2() holds if and only if all of the degree i spherical harmonics sum to 
zero over C. 

3. Orthogonal polynomials 

The proof of Theorem 11.21 involves expansions as nonnegative linear combina- 
tions of orthogonal polynomials. In this section we prove a theorem about such 
expansions that will play a fundamental role later in the paper. We prove it in 
greater generality than is required for Theorem II. 21 because it is no more difficult 
and the proof is of independent interest. 

Let ^ be any Borel measure on R such that all polynomials are integrable with 
respect to ^ and for all polynomials p, 

p{tf dn{t) > 

if p is not identically zero. Equivalently, for all polynomials p such that p(t) > 
for all t but p is not identically zero, 

p{t)dn{t) > 0. 

(Every such polynomial is a sum of two squares of polynomials; see for example 
Problem 44 in Part VI of jPS| and its solution.) 

Let po,pi, ... be the monic orthogonal polynomials for /x, with deg(pi) = i. In 
other words, for i ^ j 

J p,{t)pj{t)dn{t) = Q. 

It is known that for each a G M, the polynomial Pn + apn-i has n distinct real 
roots, which are interlaced with the roots of (see Theorem 3.3.4 in JSz|). 

Theorem 3.1. Let a be any real number, and let 

ri < r2 < ■ ■ ■ < r„ 

be the roots of pn + apn-i- Then for k < n, 



X{{t-n) 



i=l 

has positive coefficients in terms of po(t) , pi{t) , . . . ,pk(t). 

The proof is conceptually simple, but it involves some technicalities. We begin 
with the simplest case, namely k = n — I: 

Proposition 3.2. Let a be any real number, and let r be the largest root of pn + 
apn-i- Then 

Pn{t) +apn-iit) 
t-r 

has positive coefficients in terms of po{t),pi(t), . . . ,p„_i(i). 
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Proposition 13 . 21 follows from the ChristofFel-Darboux formula (Theorem 5.2.4 in 
|AAR,j ) . but we prove it directly to illustrate the technique we will apply to the 
general case. 



Proof. Define co, . . . , c„_i so that 

Pn(t) + apn-i{t) _ 



t-r ^ 



CiPiit). 



For £ < n — 1, it follows from orthogonality that 

{Pn[t) + apn-l{t)) d^.{t) = 0, 

J t — r 

because 

Peit)-pe{r) 
t-r 

is a polynomial of degree £ — 1 and hence orthogonal to both Pn{t) and pn-i{t). 
Thus, 

Pn{t) + apn^l{t) f Pnjt) + aPn-ljt) ^ , . 



t-r 

In terms of the coefficients cq, . . . , c„_i, 

Q J Pe{t)'^ d^{t) coPfXr) J dfi{t). 

Because the roots of Pn + apn-i and Pn-i are interlaced, r is greater than 
the largest root of p„_i and hence greater than the largest root of p£. It follows 
that p£{r) > 0, and thus cq, . . . ,c„_i all have the same sign. Comparing leading 
coefficients shows that c„_i = 1, so every coefficient must be positive. □ 

There is a natural generalization of this proof to the case of arbitrary k in 
Theorem l3.1l The key idea is to use orthogonality with respect to a signed measure, 
because ior k < n, 

k 

U^t-r,) 

1=1 

is the monic orthogonal polynomial of degree k for the signed measure 

{t - rk+i) . . . (t - r„) 

assuming there is a unique such polynomial, as we prove below in Lemma 13.51 and 
Lemma 13.71 To verify that, we simply need to show that it is orthogonal to all 
polynomials of degree less than k, which is equivalent with the orthogonality of 
Pnit) + apn-i(t) to all such polynomials with respect to dfi{t). (For k = n the 
conclusion is false unless a = 0.) More generally, we have the following lemma: 

Lemma 3.3. Suppose v is a signed measure that has monic orthogonal polynomials 
go, 9i, • ■ • , <lN+i, where deg{qi) — i. For i < N , if qi{r) ^ and [t — r) dv{t) has a 
unique degree i monic orthogonal polynomial, then it equals 

qi+ijt) + a»gt(<) 
t-r 

where ai is chosen so that qi^i{r) + aiqi{r) = 0. 
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Proof. The orthogonality of this polynomial with respect to (t — r) di'{t) amounts 
to the orthogonality with respect to di>(t) of qi+i{t) + aiqi{t) to all polynomials of 



One might hope to imitate the proof of ProDOsition l3.2l as one removes successive 
roots of Pn + otPn-i- In fact, this approach works, but it is not obvious that it 
works, because of a fundamental technical difficulty. As soon as one introduces 
linear factors such as t — r into the measure, it is no longer a positive measure but 
rather a signed measure. Many of the basic properties of orthogonal polynomials 
fail in the case of a signed measure. Monic orthogonal polynomials may not exist 
or be unique, and the roots of pi and Pi+i may not be interlaced. In addition, there 
is an even more dramatic problem: the proof of Proposition 13.21 depends on the 
positivity of 



neither of which is generally positive for a signed measure. 

Fortunately, the measures that actually arise in the proof will be considerably 
better behaved than a typical signed measure. In particular, they will all have the 
following property for some N: 

Definition 3.4. Let v he a signed Borel measure on R with respect to which all 
polynomials are integrable. We say v is positive definite up to degree N if for all 
polynomials / such that deg(/) < iV, 



with equality only if / is identically zero. 

If V is positive definite up to degree iV, then many of the usual properties hold 
for orthogonal polynomials of degrees up to + 1. Lemma l3 . 51 covers the properties 
we will require. 

Lemma 3.5. Let v he positive definite up to degree N. Then there are unique 
monic polynomials qa, 51, ... , gjv+i such that dcg{qi) = i for each i and 



for i j- For each i, qi has i distinct real roots, and the roots of qi and are 
interlaced. 

The standard proofs for orthogonal polynomials from Chapter 5 of |AAR| can 
be adapted, if one uses the trick of writing a nonnegative polynomial as a sum of 
two squares. Here we use instead a particularly elegant proof from ISIl p. 14]: 

Proof. The orthogonal polynomials for v are obtained by applying Gram-Schmidt 
orthogonalization to l,t, . . . , t^+^: 



degree at most i ~ 1. 



□ 




and 
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which makes sense for i < A'^ + 1 because the norm of qj in the denominator is 
always nonzero. They are unique because the quotients of integrals are the only 
coefficients in such an expansion that yield orthogonality. 

The next step is to prove the three-term recurrence relation. The polynomial 
qi{t) — tqi^i{t) is orthogonal to all polynomials of degree less than i — 2, and hence 
there exist constants and bi such that 

q,{t) = {t + ai)qi_i{t) + b,qi^2{t)- 

Multiplying by qi-2{i) and integrating yields 

h, j q,_2{tfdv{t) = - j q,^i{t)tq,^2{t)dv{t), 

by orthogonality. The polynomials tqi^2{t) and qi^i{t) differ by a polynomial of 
degree at most i — 2, and hence 

g,_i(t)%_2(0 du{t)^ j q,^i{tf dv{t). 

Combining the last two equations shows that 6^ < 0. 

We now prove by induction that the roots of qi and qi-i are real and interlaced. 
The base case of i = 1 is trivial. For the induction step, suppose the roots of qi-i 
and qi-2 are real and interlaced. For each root r of gi-i, 

q,{r) = hq^^2{r), 

by the three-term recurrence relation. By hypothesis, qi^2 alternates in sign at the 
roots of qi-i. Because bi is negative, qi and qi-2 have opposite signs at the roots of 
qi-i- That implies that qi must have a root between each pair of consecutive roots 
of qi-i. Because qi(t) and qi-2{t) have the same sign when \t\ is sufficiently large, 
qi must also have a root greater than every root of qi-i and a root less than every 
root of qi-i. All i roots of qi are now accounted for, so all the roots of qi are real 
and interlaced with those of qi-i- That completes the proof. □ 

The following minor variant of Gauss- Jacobi quadrature (see Theorem 5.3.2 in 
jAARj ) will prove crucial for understanding the signed measures that arise in our 
proof. 

Lemma 3.6. Let a be any real number, and let 

ri < r2 < • • • < r„ 

be the roots of Pn + apn-i- Then there are positive numbers Ai, . . . , A„ such that 
for every polynomial f of degree at most 2n — 2, 

1=1 

Proof. There exist coefficients Ai, . . . , A„ such that 



^ n 



whenever deg(/) < n, because a polynomial of degree less than n is completely 
determined by its values at ri r,i and the map taking these values to the integral 
is linear. 
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For deg(/) < 2n — 2, there exist polynomials g and h satisfying 

/ = [Pn + CtPn-l)g + h 

with deg(g) <n — 2 and deg(/i) < n. By orthogonality, 



and f{ri) — h{ri) for each i because p„(ri) + Q;p„_i(ri) = by definition. It follows 
that 



/(t)d/x(i) = ^AJ(r,) 



i=l 

holds whenever deg(/) < 2n — 2. 

All that remains is to prove positivity for the coefficients. For fixed i, let 

fit)^]l{t-r,r. 

Then 

from which it follows that > 0. □ 

Let ri < • • • < r„ be the roots of p„ + apn-i, and for < j < n define the 
measure /ij by 

dfJ'jit) = ni'^n-i - t)dn{t) 

(of course /iq = m)- 

Lemma 3.7. For Q < j < n — \. the measure fij is positive definite up to degree 
n-j-1. 

Proof. If deg(/) < n — 1 — j/2, then by Lemma [3. 61 

/j-l n j-1 

f{t? Y[{rn^^ ~ t) dm = J2 ^di^^)^ Uirn-i - n) > 0; 

i=0 i=l 1=0 

equality holds if and only if / vanishes at ri , . . . , r„_j , which is impossible for a 
polynomial of degree at most n — j — 1 unless it vanishes identically. □ 

Let qj^i be the monic orthogonal polynomial of degree i for fij. (Note that 
Qo,i — Pi-) The polynomial qj^i is simplest when i ^ n — j, in which case we have 

<lj,n-jit) = {t~ri)...{t- r„_j): 

as pointed out earlier. That implies that for i < n~ j, the largest root of qj^i is less 
than rn~j, because the roots are interlaced. 

Equivalently, for i < n — j, the largest root of qj-i.i is less than rn-j+l^ so 
qj-i^i{rn-j+i) ^ 0. Hence Lemma implies that for i < n — j, there are constants 



Uj^i such that 



„ /,N _ gj-i,*+i(i) + a],iqj-iAt) 



Lemma 3.8. For 1 < j < n and i < n — j , the polynomial qj i is a positive linear 
combination of qj-i^, . . . , qj-iA- 
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Proof. Define co, . . . , Ci so that 

i 

i=0 

We argue as in the proof of Proposition 13. 21 For £ < i, 

(3.1) / fa-^i..+i(t) + a,,q,_,,{t)) '^'-'■'^^l I l^-^Arn-. + l) ^ ^ 

by orthogonahty, because 

is a polynomial of degree I — 1, which is less than i. It now follows from 

_ gj-i,»+i(0 +aj,»gj-i,»(0 

^ ' n—j + l 

and H3.1|l that 
Thus, 

^2 



ci J qj^iAt) d^j^i{t) = caqj-iAr-n-j+i) J d^j^i{t). 

Because i < i < n — j , both integrals are positive by Lemma [3. 71 The largest root 
of qj-i,e is less than so 

gj_i/(r„_j+i) > 0. 

Thus, Co, . . . , c,; all have the same sign, which is positive because = 1. □ 
Theorem 13. II follows from applying Lemma l3 . 81 repeatedly starting with qk,n-k- 

4. Linear programming bounds 

Theorem 11.21 is deduced from the following proposition, which originates in 
and is the key technique in |Y| lEYTI IE721 IXiiTl IXn2] (here we give a somewhat 
different proof from the one in jYj, along the lines of |An2| l: 

Proposition 4.1. Let f : (0,4] —> R be any function. Suppose h: [—1, 1] — > M is a 
polynomial such that 

h{t) < f{2 - 2t) 

for all t e [—1, 1), and suppose there are nonnegative coefficients ao, . . . ,ad such 
that h has the expansion 

h{t)^Y.a.C-''-\t) 

i=0 

in terms of ultraspherical polynomials. Then every set of N points on S"'~^ has 
potential energy at least 

N^ao - Nh{l) 
with respect to the potential function f . 
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Fundamentally, this proposition is a generalization of the linear programming 
bounds for spherical codes due independently to Kabatiansky and Levenshtein in 
|KL| and Delsarte, Goethals, and Seidel in 'DtI_SI (see also Chapter 9 in ICS ). As 
is pointed out in Y , one can derive those bounds for spherical codes with minimal 
angle from Proposition by setting 



(Of course, / takes values in M U {oo}, but that is not a problem.) The potential 
energy is then for a spherical code with minimal angle at least and oo otherwise; 
Proposition 14.11 implies that no such code can exist if > h{l)/aQ. 

Choosing the optimal function h amounts to solving an infinite-dimensional 
linear programming problem: the potential energy bound is a linear functional 
of /i, and the only restrictions on h are the linear constraints on its values and 
ultraspherical coefficients. As in the case of the traditional linear programming 
bounds, it is not hard to approximate the optimum numerically (by solving a 
finite-dimensional linear programming problem in which one imposes the constraint 
h{t) < /(2 — 2t) for only finitely many values of t). Even though the bound is usu- 
ally not sharp, it is often quite close. For example, for twenty points on and 
the Coulomb potential function /(r) = the minimal-energy configuration, 

which incidentally is not the vertices of a regular dodecahedron, appears to have 
potential energy 301.763 .... The polynomial 



proves a lower bound of 301.204. It is not the best possible choice of h, but we have 
found no polynomial that beats 301.5. 

What is remarkable is that in the case of the sharp arrangements, for every 
completely monotonic potential function there is an auxiliary function that proves 
a sharp bound, as we show in the proof of Theorcm ll.2l 

Proof of Proposition Suppose C C S""^^ has cardinality A^. By assumption. 




oo if t > cos 9, and 
otherwise. 



h{t) 



0.8729Co/^(i) + 0.634Cy^(<) + 0.425C2/^(i) -I- 0.258Cl^^{t) 
+ 0.135Cy^(i) + 0.0569C5/^(i) + 0.016Ce/^(t) 



E fi\--yn> 



because \x — = 



2 — 2(x,y). On the other hand, 



E 



Nh{i)+ E Hi^^y)) 



x,y&C 



d 



i=0 x,yeC 



Because C[ 



,n/2-l 



is a positive-definite kernel (see Subsection 12. 2|l 



x,yeC 
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It follows that 

h{{x,y))>-Nh{l) + ao ^ C';;/^^\{x,y)) = N^ao - Nh{l), 

as desired. □ 

The bound proved by an auxiliary polynomial h is sharp for a configuration C 
and potential function / if and only if two conditions hold: h{t) must equal f(2 — 2t) 
at every inner product t that occurs between distinct points in C, and whenever the 
ultraspherical coefficient ai is positive with i > 0, we must have 

Y: Cf'-\{x,y))=0. 

x,yeC 

In particular, if h is strictly positive definite, then C must be a spherical deg(/i)- 
design, by Theorem 5.5 in jDGS| . 

One can easily generalize Proposition 14.11 to deal with the case of unequally 
charged particles (this was pointed out in Theorem 1 of (Y|). Suppose we have N 
real numbers ci, . . . , cat, which we refer to as charges; they must all have the same 
sign. Given a potential function / and an ordered TV-tuple {xi, . . . , xn) & (S"""^)^, 
define the potential energy to be 

Y C,Cjf{\x^-Xj\'^). 
l<'i,j<N,i^j 

Under the conditions in Proposition 14.11 no such configuration can have potential 
energy less than 

We know of few beautiful configurations with unequal charges and none for which 
linear programming bounds are sharp, but they may well exist. 

5. Choosing auxiliary functions 

In this section, we describe an explicit choice for the auxiliary function h in 
Proposition ^2 it is analogous to the constructions using the Christoffel-Darboux 
formula in |KL1 IMRRWj . We will not require this function later in the paper, 
but it provides a simple way to apply Proposition 14. II and the techniques used in 
analyzing it will be important in the proof of Theorem 1 1.21 

Let / : (0, 4] ^ M be completely monotonic. From the proof of Proposition l4.1l we 
know that it is more natural to look at a{t) — /(2 — 2t), because |a;— — 2—2{x, y). 
The function a is absolutely monotonic on [—1, 1) because / is completely monotonic 
on (0, 4], and a is strictly absolutely monotonic if and only if / is strictly completely 
monotonic. 

Our construction of h requires three inputs: the dimension n — 1 of the sphere 
S"""^, a natural number m > 1, and a real number a > 0. Let 

ti < ■ ■ ■ < t,n 

be the roots of the polynomial C'm^ ^ + ceCl^l^i ^- We require that ti > —1 
and thus {ti, . . . ,tm} C [—1, 1); by Theorem 3.3.4 in .S.z., that amounts to a < 
—Cm^^^{—l)/Cl^^i^{—l) (which is positive because C"^^~"^(— 1) has sign (— 1)*). 
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Let h{t) be the Hermite interpolating polynomial that agrees with a{t) to order 2 
at each ti (i.e., h{ti) — a{ti) and h'{ti) — a'{ti)). The idea of constructing auxiliary 
functions via Hermite interpolation originates in [Y) . 

Lemma 5.1. For all t £ [—1, 1), 

h{t) < a{t). 

Proof. By Lemma [2. II there exists a point ^ such that 
min(i,ti, . . . ,t™) < ^ < max(i,ti, 

t )^ 

□ 

The last requirement before we can apply Proposition 14.11 namely that h must 
be a positive-definite function, is more subtle. Let 

m 

F{t) = Cll^-\t) + c.cTS^\t) = - ^0- 

1=1 

In the notation introduced before Proposition 12.21 h ~ H{a,F^). We will show 
that F^ has a strong property that we refer to as conductivity: 

Definition 5.2. A nonconstant polynomial g with all its roots in [—1,1) is con- 
ductive if for all absolutely monotonic functions a on [—1,1), H{a,g) is positive 
definite. It is strictly conductive if it is conductive and for all strictly absolutely 
monotonic a, H{a,g) is strictly positive definite of degree deg{g) — 1. 

Lemma 5.3. // gi and g2 are conductive and gi is positive definite, then gig2 
is conductive. If gi is conductive and strictly positive definite and g2 is strictly 
conductive, then gig2 is strictly conductive. 

Proof. As noted in the proof of Proposition 12.21 for every absolutely monotonic 
function a on [—1, 1), 

H{a, gig2) = H{a, gi) + giH{Q{a, gi), 52)- 

The lemma now follows from Proposition 12 . 21 and the fact that the set of positive- 
definite (or strictly positive-definite) functions is closed under taking products. □ 

For r S [—1, 1), let ir denote the linear polynomial £r{t) — t — r. Clearly £r is 
conductive, and even strictly conductive if r ^ — 1, because H(a, £r) is the constant 
polynomial a(r), which is nonnegative. Lemma |5.3I implies that if is conductive 
and positive definite, then g£r is conductive. 

Consider the partial products 

f[(.t-t.) 

1=1 

for j < m. By Theorem l3.ll this product is positive definite for j < m (even strictly 
positive definite), and for j = m it is positive definite because a > 0. It follows 



and 



a{t)-h{t) = —^{t~t,)\..{t 



(2m)! 

The right side is nonnegative, and hence a(t) > hit) 
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from Lemma 15.31 and the remarks in the previous paragraph that each product is 
conductive. Thus, 

i=l 

is conductive, so Lemma 15.31 imphes that its square is as well. Because h = 
H{a,F^), we conclude that h is positive definite, as desired. 

These auxiliary polynomials are rarely optimal. For example, consider the case 
n = 3, N = 20, and /(r) = that was discussed in Sectional Numerical 

calculations suggest that the best bound is obtained by taking m = 3 and a = 
0.752718117 . . . (an algebraic number of degree 22). That yields a lower bound of 
299.708 . . . , which is worse than the bound of 301.204 derived in Section^] However, 
it is reasonably close, especially given how simple and systematic the construction 
of h was. 

The connection between our construction and those used in |MR,RWj and |KL| 
may not be immediately apparent. Those constructions are equivalent to using the 
polynomial 

m— 1 
i=l 

in the linear programming bounds for spherical codes with minimal angle 9 satis- 
fying cos 6 — tm ■ The fact that 

m — 1 

i=l 

is positive definite is proved in those papers using the Christoffel-Darboux formula. 
Unlike in our construction, a can be arbitrarily large. 

One can generalize the construction from this section as follows. Let poiPij • ■ ■ 
be any family of orthogonal polynomials such that each is positive definite. Given 
m and a, choose ti, . . . ,tm to be the roots of pm + apm~i- The construction and 
proof are essentially the same as before. The proof of Theorem II . 21 will be based 
on this approach. 

6. Proof of the main theorem 

Let / : (0, 4] ^ R be completely monotonic, define a{t) = /(2— 2i) as in Section|Sl 
and let C C 5*"^^ be a sharp arrangement with |C| = iV. To prove Theorem 11.21 
we will construct an auxiliary polynomial h that satisfies the hypotheses of Propo- 
sition ^3 a-nd proves a sharp bound. The case when C is the set of vertices of the 
600-cell is qualitatively more difficult than the other cases, so for the time being we 
assume that we are in the sharp case. (We deal with the 600-cell in Section [7|) 

Suppose that the possible inner products that occur between distinct points in 
C are ti, . . . , tm, ordered so that 

-1 < ii < • • • < i„j < 1. 

Let h{t) be the Hermite interpolating polynomial that agrees with a{t) to order 2 
at each ti. 

When ti = —1, it might seem more natural to interpolate only to first order at 
that point. The purpose of second-order interpolation is to avoid sign changes in 
h{t) — a(t) when t = U, and that is not a concern when t — —1. In fact, first- 
order interpolation works there, but we will omit the details here, because the only 
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advantage we see in it is that it leads to a slightly improved bound on the number of 
derivative sign conditions one must impose on the potential function (it decreases 
by 1 the bound K mentioned in Section^. See Section |S1 for the details of that 
approach in a context in which it is more useful. 

Lemma 6.1. For all t e [—1, 1), 

h{t) < a{t). 

The proof is identical to that of Lemma IFITI Thus, the first hypothesis of Propo- 
sition 14.11 is satisfied. What remains to be shown is that h is & positive-definite 
function and that the bound is sharp. (The bound derived from this function h 
is not sharp for the 600-cell, which is why that case is more subtle and we have 
postponed it until later.) 

The sharpness of the lower bound is simple, because it follows from the definition 
of a sharp arrangement that C is a spherical deg(/i)-design. Thus, 

x.yeC 

whenever < z < deg{h); in fact, for each fixed y, the polynomial C"^^ ^{{x,y)) 
has the same average over C as over the entire sphere 5*""^, and it integrates 
to over the sphere because it is a positive-degree spherical harmonic and thus 
orthogonal to the constant functions. Because h agrees with a at the inner products 
ti, . . . ,tm from C, the conditions for a sharp bound that were listed after the proof 
of Proposition ^3 are satisfied. 

The proof that h is positive definite is analogous to the proof in Section El but 
it will involve more elaborate lemmas. Define 

m 
i=l 

Then h = i?(a, F^), and we will prove that F'^ is conductive. In fact, we will show 
that F"^ is strictly conductive, which will be important in the proof of uniqueness. 

Lemma 6.2. The Junction F is strictly positive definite. 

This lemma is a standard fact in the theory of linear programming bounds. We 
prove it as in pl| . 

Proof. The leading coefficient of F in terms of powers of t is positive, which implies 
that its leading ultraspherical coefficient is also positive. For the others, we apply 
orthogonality to see that the z-th ultraspherical coefficient of F equals a positive 
constant (depending on i) times 

/ F{{x,y))C;^'-'{{x,y))dx, 

where y is an arbitrary point on S*"^^ and of course dx denotes surface measure on 
S"~\ Choose yeC. 

Because we know that the leading ultraspherical coefficient in F is positive, we 
can take i < deg{F). Thus, deg(F) + i < 2deg(F) — 1. Because C is a sharp 
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configuration, it is a spherical (2deg(i^) — l)-design. It follows that the integral 
above equals a positive constant times 

By the construction of each term in the sum vanishes except for x = y, so the 
sum equals i^(l)C"^^^^ (1). Both factors are positive, as desired. □ 

In order to deal with the partial products 

j 

1=1 

we must express F in terms of orthogonal polynomials. Recall that the ultra- 
spherical polynomials C"^^ ^ are orthogonal with respect to the measure 

(l_i2)(„-3)/2^^ 

on [—1, 1]. Call that measure dfi(t). Let pQ,pi, . . . denote the monic orthogonal 
polynomials with respect to (1 — t)dij.{t). These polynomials are a special case 
of Jacobi polynomials, but we will require only one fact about them: they are 
nonnegative linear combinations of the ultraspherical polynomials C"^^ ^ . That 
follows from Lemma 13.31 and Proposition 13. 21 

Lemma 6.3. There exists a constant a such that F ~ pm + ctpm-i- 

Proof. We simply need to show that F is orthogonal to all polynomials of degree at 
most m—2 with respect to the measure (l — t) dfi(t). That is equivalent to showing 
that (1 — t)F{t) is orthogonal to all such polynomials with respect to d^{t). 

Let p be any polynomial of degree at most m—2. Because C is a sharp config- 
uration, it must be a spherical (2to — l)-design. It follows as in the previous proof 
that 

j {l-t)F{t)p{t) df,{t) 
equals a positive linear combination of 

(1 - . . . , (1 - i,„)F(i„)p(t„0, (1 - 

Each of these vanishes because F vanishes at ii, . . . , t^. Thus, 

J F{t)p{t) (1 - t)dii{t) = 0, 

as desired. □ 
Combining Lemma l6 .31 with Theorem 13 . II shows that for j < to, the polynomial 

f[it-u) 

i=l 

is strictly positive definite, and the case j = to is Lemma |6.2I Now the proof that 
F and F^ are conductive concludes as in Section|5l by using Lemma lS^l reoeatedlv. 

We must still prove the additional uniqueness results when / is strictly com- 
pletely monotonic. For that, we use the following lemma: 

Lemma 6.4. If a satisfies a^'^-'(t) > for all k > and t e (—1, 1), then a{t) — h{t) 
has at most Aeg{K) + 1 roots in [—1, 1), counted with multiplicity. 
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Proof. By Rolle's theorem, a{t) — h{t) has at most one more root than a'{t) — 
h'{t). Repeated differentiation reduces to the case in which deg(/i) — 0, which is 
trivial. □ 

Thus, the only roots of a{t) — h{t) in [—1, 1) are at t = U for some i. It follows 
from the proof of Proposition ^3 that if C is any other potential energy minimum 
with |C'| = |C|, then all inner products between distinct points of C occur among 
ti, . . . ,tm- Thus C is a spherical code with the same parameters as C, and each 
sharp configuration listed in Table ^ except on the last line is known to be the 
unique spherical code with its parameters, up to orthogonal transformations, of 
course. Sec Appendix 1X1 for details and references. 

Even when the spherical code is not unique, one can still conclude from the strict 
absolute monotonicity of a that C must be sharp. The key observation is that 
is strictly conductive (except in the trivial case when C consists of two antipodal 
points) , which follows easily from Lemma l5.3l It implies that h is a strictly positive- 
definite polynomial of degree 2m— 1. Then the sharpness in Proposition l4.1l implies 
that C' must be a spherical deg(/i)-design (by Theorem 5.5 in (DGSj ) and hence a 
sharp configuration. Furthermore, each ofti, . . . ,tm must occur as an inner product 
in C, from switching the roles of C and C . 

7. The 600-cell 

The final configuration is the vertices of the regular 600-cell. The construction 
in Section does not work in this case, and overall the 600-cell appears to be 
intrinsically more complicated than the sharp configurations. The fundamental 
problem is that it is only a spherical 11-design, but the polynomial h constructed 
as in the previous section would have degree 15 (or 14 if one uses the alternate 
construction for antipodal configurations). Recall that being a spherical deg(/i)- 
design was crucial to having a sharp bound. In fact, h does turn out to be a 
positive-definite function in this case, but it proves a suboptimal bound. 

Andreev and Elkies have shown in |An3[ IE1| that a degree 17 polynomial with 
coefficients in Q(-\/5) proves a sharp bound for the number of points in a spherical 
code with minimal angle tt/5 = cos~^(l -I- \/5)/4 on S^, although polynomials of 
lower degree do not work. A similar phenomenon occurs in our problem. What 
saves the proof is that although the 600-cell is not a spherical 12-design, all spheri- 
cal harmonics of degrees from 13 to 19 do indeed sum to over the 600-cell. Degree 
12 is the only problem, and that can be addressed by ensuring that the 12-th ultra- 
spherical coefficient of h vanishes. The fact that spherical harmonics of degrees 13 
through 19 sum to over the 600-cell is easily checked using the distance distribu- 
tion from Table 12 which tells for each vertex how many others have a given inner 
product with it. One can also prove it using invariant theory as follows. The sym- 
metry group of the 600-cell is the H4 refiection group, for which the ring of invariant 
polynomials has generators of degrees 2 (the trivial invariant x 1— > |a:p), 12, 20, and 
30. Summing a polynomial over the vertices of the 600-cell amounts to averaging it 
with respect to H4. For any degree that is not a nonnegative integer combination 
of 12, 20, and 30, averaging must yield a polynomial that is constant on S^, and 
that constant vanishes for spherical harmonics because they are orthogonal to the 
constant polynomials. 

For the 600-ceU, we have m = 8 and {h, . . . ,t„} = {-1,0, ±1/2, (±1± V5)/4}; 
we order the inner products so that ti < ■ ■ ■ < ts- Let h{t) be the unique polynomial 



UNIVERSALLY OPTIMAL DISTRIBUTION OF POINTS ON SPHERES 



127 



Table 2. The distance distribution of the 600-cell. 

Inner Product Count 

±1 1 

(±l±^/5)/4 12 

±1/2 20 

30 



of degree at most 17 such that h{ti) — a{ti) for 1 < i < 8, h'{ti) = a'{ti) for 
2 < j < 8, and an = ai2 = ai3 = 0, where denotes the i-th ultraspherical 
coefficient of h. Note that we do not require that h'{—l) = a'(— 1). 

It might seem more natural to use a polynomial of degree 15 and simply set 
ai2 = 0, but that does not work. We have no conceptual explanation for why it 
fails or why our slightly more complicated approach succeeds. 

If h{t) < a{t) for all t and h is positive definite, then it proves a sharp bound 
(as pointed out above, because = 0). Unfortunately, we know of no simple 
reason why these inequalities hold. We have proved them using a computer algebra 
system. The proof is completely rigorous, in the sense that it uses no floating point 
arithmetic or other unrigorous approximations; all computations are performed 
using exact arithmetic in 



7.1. Proof that h is positive definite. We wish to prove that h{t) is a nonnega- 
tive linear combination of the ultraspherical polynomials Cf (t) . Each ultraspherical 
coefRcient of h(t) is a linear function of a{ti) and a'(ti) for I < i < m. Let ut denote 
the coefRcient of a{ti) in this linear function, and let Vi denote the coefficient of 

As explained in Section ^ it suffices to consider the potential functions /(r) = 
(4 — r)*^ with k G {0, 1,2,...}; up to scaling, that amounts to taking a{t) = (l+t)''. 



The linear combination 



{uia{U) + Via'{ti)) 



then becomes 



i=l 

If Vg > 0, then this expression is positive for all sufficiently large k. To prove that 
it is nonnegative for all fc, we simply compute a bound £ such that it is guaranteed 
to be positive for all k > £ and then check nonnegativity for < k < £. 
Let 



if a; < 0, and 
if X > 0. 



If £ is chosen so that 

m 
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then 



for all k > £. In other words, once the asymptotically dominant term outweighs all 
the negative terms, it continues to do so forever (because whenever k increases, the 
dominant term increases by a larger factor than any other term does). 

To prove that h is positive definite, we deal with eighteen cases, one for each 
ultraspherical coefficient of h. In each case, we compute the coefficients 

ui, . . . ,us,vi, . . . ,vs 

exphcitly as elements of (Q)(\/5). We then check that vs > and that l|7.1|l holds 
with i = 32. Finally, we check that the ultraspherical coefficient is nonnegative for 
k € {0, 1, . . . ,31}. These calculations are too cumbersome to carry out by hand, 
but they are easily done using a computer algebra system. 

7.2. Proof that h{t) < a{t). Unfortunately, the inequality h{t) < a{t) no longer 
has as simple a proof as in the case of the sharp configurations. To prove it, we will 
use Proposition Let 



F{t) = {t+l)ll{t + t,f 



i=2 

and let h be the usual Hermite interpolation H{a, F) of a (without requiring any 
ultraspherical coefficients to vanish). By Proposition 12. 21 

a{t) - h{t) 

m 

is absolutely monotonic on [—1, 1). Let q{t) be the quadratic Taylor polynomial for 
this quotient about t = —\. Then it follows from absolute monotonicity that 

a{t) - h{t) 



m 

for all t e [-1,1). Thus, 



><l{t) 



ajt) - h{t) ^ , h{t)-h{t) 
Fit) Fit) ■ 

The right side of this inequality is a quadratic polynomial. If we verify that it 
is nonnegative over [—1,1), then a(i) > hit) over that range (because Fit) > 
0). To check this, we simply check that it is nonnegative a.t t = ±1 and has 
nonpositive leading coefficient. Each of these calculations amounts to checking that 
a certain explicit linear combination of a(ti), . . . ,0(^8), a'(ii), ■ ■ ■ iCi'its), a^'^\—l), 
and a'^^^(— 1) is nonnegative. We prove it using the method from the previous 
subsection, with £ — 36. 

7.3. Proof of uniqueness. Finally, we must deal with the issue of uniqueness 
when the potential function / is strictly completely monotonic (so a is strictly 
absolutely monotonic). In that case, by Proposition l2.2l 

ait) - hit) 
Fit) 
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is strictly absolutely monotonic on (—1, 1), which implies that 

Fit) ^ ^ 

for t e (—1, 1). Thus, a{t) — h{t) has roots only &i t = ti for some i. Uniqueness 
follows, because the vertices of the regular 600-cell are the only (4, 120, (1 + \/5) /4) 
spherical code (see IB6I p. 260] or |BDp . 



8. Other compact two-point homogeneous spaces 

The key property underlying our proofs is the two-point homogeneity of the 
sphere. Recall that a metric space is two-point homogeneous if for each r > 0, 
its isometry group acts transitively on ordered pairs of points at distance r. In 
this section we generalize Theorem 11.21 to all compact, connected two-point ho- 
mogeneous spaces. This generalization is parallel to the general setting for linear 
programming bounds in jCSI ILev2 2^ev3^ . It seems plausible that our results also 
generalize to the case of discrete two-point homogeneous spaces (see Chapter 9 of 
|CS| for examples), but we have not investigated that possibility. 

Let X be a compact, connected two-point homogeneous space with more than 
one point. Wang proved in jWaj that, up to changing to an equivalent metric (i.e., 
applying an invertible function to the metric), X is one of the following possibili- 
ties: a sphere, a real projective space, a complex projective space, a quaternionic 
projective space, or the octonionic projective plane, each with its standard metric. 
In other words, we use the round metric on the sphere, the Fubini-Study metric on 
real, complex, or quaternionic projective space, and the .Fl^-invariant metric on OP^ 
(up to scaling, they are the unique two-point homogeneous Riemannian metrics on 
these spaces). 

The octonionic projective plane OP^ may be the least familiar of these possi- 
bihties; it cannot be defined as O'^ \ {0} modulo the action of O* because of the 
nonassociativity of the octonions (O* is not a group and does not act on O'^). How- 
ever, it can be defined using the exceptional Jordan algebra or as F4/Spin(9). There 
is no such space as OP'', because OP^ does not satisfy the Desargues theorem. See 
|Ba| for more details on octonionic projective geometry. 

Normalize the metric d on X (the geodesic distance with respect to the Rie- 
mannian metric) so that the greatest distance between points is tt. Just as we use 
inner products to measure distances between points on spheres, it is natural to use 
cos d{x, y) instead of d{x, y). In MP", CP", or HP" this function is easy to calculate: 
if we represent x and y by unit vectors in C""*"^, or EI"+^, respectively, then 

cosd{x,y) = 2|(x,y)p — 1, where the brackets denote the standard Hermitian form 
over K, C, or H. 

Spheres are n-point homogeneous for every n, but the other two-point homo- 
geneous spaces are not even three-point homogeneous. One consequence is that 
configurations in projective space can frequently be deformed without changing the 
distances between points. For example, for t g C with |r| = {l+\/5)/2, consider the 
configuration Cr consisting of the six points in CP^ with homogeneous coordinates 
[t,1,0], [r,-l,0], [1,0, r], [-1,0, r], [0,t,1], and [0,r,-l]. When r = (1 + V5)/2, 
they are the images in MP^ C CP^ of the vertices of a regular icosahedron in IR'^. 
The cosine of the distance between any two distinct points is —3/5, so the points 
form a regular simplex embedded in CP^ . It will follow from Theorem 18.21 that 
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each Ct is a universally optimal six-point code in CP^. Lemma WT\ shows that these 
configurations form a one-dimensional family. By contrast, positive-dimensional 
families of sharp configurations do not exist in spheres. 

Lemma 8.1. For |r| = |r'| = (1 -|- ^5)72, there is an isometry ofCP^ mapping 
to Ct' if and only if t/t' or t/t' is a sixth root of unity. 

Proof. The isometry group of CP^ is generated by the action of U{i) on lines in 
C'^ together with complex conjugation. Complex conjugation sends Cr to Cf, the 
diagonal unitary transformation that multiplies the j-th coordinate by e^'^^^l'^ sends 
Cr to Cre-2,ri/3, and the identity map sends Ct to C-t. In the other direction, if 
a;, y, and z are unit vectors in the six lines in C'^ corresponding to the points of Cr, 
then 

(x, y){y, z){z, x)e{l, 1/5, ±1/(5V5) , ±rV(|rp + l)', ±fV(|rp + l)'} , 

and each of these values occurs (note that the triple product {x,y){y,z){z,x) is 
independent of which unit vector is chosen in each fine). Because |tP/(|tP -t- l)'^ = 
l/(5\/5), the isometry class of Cr determines the set {±l,±T^,±f^}, and that 
proves the necessity of the condition in the lemma statement. □ 

The potential functions we will consider are the absolutely monotonic functions 
of cosd{x, y). As in the case of the sphere, these functions can also be understood 
in terms of another metric, namely the chordal distance dc{x,y) = sin (d(a;, y)/2) 
introduced for Grassmannians in (i.e., .yi-\{x,y)\^ when x and y are repre- 

sented by unit vectors as above). For motivation, recall that the Euclidean distance 
between two points separated by an angle of 9 on the unit sphere is 2sin(0/2). For 
projective spaces, the chordal distance is most naturally expressed in terms of the 
orthogonal projection operators Tlx and Hy onto the lines x and y. Then 

dcix,y) = \E^_EylL^ 
where | • |f is the Frobenius norm, defined by 

Absolutely monotonic functions of cos d{x, y) are the same as completely monotonic 
functions of squared chordal distance, just as in the case of the unit sphere. 
Let G be the isometry group of X. Then one can decompose L'^{X) as 

f>0 

where Vo,Vi,... are certain finite-dimensional irreducible representations of G. 
This decomposition is unique because the sunimands are pairwise nonisomorphic. 
For each £, one can define a positive-definite kernel Kg on X as in the case of the 
sphere, and one can define a function : [— 1, 1] ^ M such that 

Ke{x,y) = Ci{cosd{x,y)). 

We index the representations Ve so that Cg is a polynomial of degree £. One can 
compute these polynomials explicitly (see for example |Ganl p. 178]). For projective 
spaces, the result is that Ci is a positive constant times the Jacobi polynomial 
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Pj"''^\ where a = n/2 - 1, n is the dimension of X as a real manifold, and (3 is 

— 1/2, 0, 1, or 3 according to whether one is working over M, C, H, or O, respectively 
(i.e., j3 = (dimR ^)/2— 1 for the algebra A). We define a positive-definite polynomial 
to be a nonnegative linear combination of Co, Ci, . . . . 

An M -design in X is a finite subset C d X such that for f E Vi with 1 < i < M, 

x<£C 

As in the case of spheres, being an M-design is also characterized by the requirement 
that 

E Gi{cosd{x,y)) = 

for 1 < i < AI. For example, to verify that the configurations Cr constructed above 
are 1-designs in CP^, one need only check that 6Ci(l) + 30Ci(— 3/5) = 0. 

We say that a finite subset C G X is a sharp configuration if it is an M-design, 
TO distances occur between distinct points in C, and M > 2m —1 — 6, where 
S — 1 there are two antipodal points in C and S ~ otherwise. By a pair of 
"antipodal points" we mean points separated by the greatest possible distance in 
X; this condition is more subtle in projective spaces than in spheres, because it is 
no longer true that each point has a unique antipode. 

This definition of a sharp configuration agrees with the previous one in the case of 
spheres. Here we merely require an antipodal sharp configuration to be a (2to — 2)- 
design, but in the case of spheres every antipodal (2to — 2)-design is automatically 
a (2m — l)-design: by Theorem 7.4 of |D(tS| . every point in such a design has 
an antipode if one does, and that suffices to make every odd-degree homogeneous 
polynomial sum to zero. On the other hand, some antipodal sharp configurations 
in projective space are only (2to — 2)-designs. For example, the D4, Eq, and Ej 
root systems give antipodal sharp configurations of 12, 36, and 63 points in RP'^, 
MP^, and RP^, respectively. For these codes to = 2 (the cosines of the distances are 

— 1 and —1/2), and they are 2-designs but not 3-designs. By contrast, Es yields a 
3-design in MP^. 

The following theorem generalizes Theorem 11.21 and the results of Levenshtein 
from |Lev2| (namely, that all sharp configurations are optimal codes): 

Theorem 8.2. Let f : [—1,1) ^ M 6e absolutely monotonic, and let C C X be a 
sharp configuration. If C <Z X is any subset satisfying \C'\ = \C\, then 

E f{cosd{x,y)) > f{cosd{x,y)). 

x,yGC' ^x^y x.y^C.x^y 

If f is strictly absolutely monotonic, then equality implies that C is a sharp config- 
uration and that the same distances occur between points in C and C . 

Naturally, we call 

f {cos d{x,y)) 

x.y^C.x^y 

the /-potential energy of C. Strictly speaking, this definition conflicts with our ear- 
lier definition for the special case X = 5""^, because previously we used functions 
of squared Euclidean distance as our potential functions. However, that should not 
cause any confusion: the Euclidean distance is twice the chordal distance, so this 



132 



HENRY COHN AND ABHINAV KUMAR 



change simply amounts to the difference between using the squared chordal dis- 
tance and the cosine of the geodesic distance. In the terminology of Sectional here 
we are dealing with the absolutely monotonic function a instead of the completely 
monotonic potential function, and thus our previous framework is equivalent to the 
current one in the case of S^~^ . 

The proof of Theorem 18.21 is parallel to the proof for spheres, so we merely 
sketch it here, with particular attention to arguments that require modification in 
projective space. 

The linear programming bounds generalize in a straightforward manner, with 
essentially the same proof: 

Proposition 8.3. Let f : [—1,1) M. be any function. Suppose h: [—1,1] K 
is a polynomial such that h{t) < f(t) for all t G [—1,1), and suppose there are 
nonnegative coefficients ao, . . . , ad such that h has the expansion 

d 

= ^a,a(i). 

1=0 

Then every set of N points in X has f -potential energy at least 

N^ao - Nh{l). 

We apply this proposition in the same way as before. Suppose first that C is a 
{2m — l)-design, with m distances between distinct points in C. Let ti < ■ ■ ■ < t^^ 
be the cosines of those distances. As before, we construct the Hermite interpolating 
polynomial h of degree 2m — 1 that agrees with / to order 2 at each of ii, . . . , tm- 
The proof that h{t) < f{t) for all t G [—1, 1) is the same as in Lemma WT\ 

Define 

m 

F{t)^l[{t-t,). 

i=l 

The same proof as in Lemma [6.21 shows that F is positive definite, and we apply 
an analogue of Lemma 16.31 to express g in terms of a related family of orthogonal 
polynomials as pm + apm-i- To complete the proof that h is positive definite, we 
use the fact that the polynomials in that family are themselves positive definite, 
together with Theorem 13.11 to establish conductivity. Everything is completely 
parallel to the spherical case. Finally, because C is a {2m — l)-design and deg{h) = 
2m — 1, the lower bound for potential energy must be sharp. When / is strictly 
absolutely monotonic, the uniqueness argument is the same as before. 

When ti = —1 and C is a (2m — 2)-design but not a {2m — l)-design, one must 
instead use first-order Hermite interpolation at t = —1, which complicates the 
proof. We can assume that m > 1, since if m = 1, then every pair of points in C is 
antipodal and Theorem 18 . 21 visiblv holds. 

If we define 

m 

F{t)^l[{t-t.), 

1=2 

then the heart of the proof is the verification that (i-f l)F(i)^ is strictly conductive, 
and everything else works exactly the same as before. Let go , gi , . . . be the monic 
orthogonal polynomials with respect to (1 — i^) d^{t), where ^ is the measure on 
[— 1, 1] with respect to which the polynomials Ci are orthogonal. (Recall that they 
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are certain Jacobi polynomials P^°'''^\ Then dn{t) = (1 - t)"{l + dt.) The 
argument used to prove Lemma l6 .31 shows that F = Qm-i + o'Zm-2 for some a. 

The most subtle part of the proof of Theorem 18. 21 is the proof that a > 0. (If C 
were a (2m — l)-design, then a would vanish.) We start with a crucial observation 
about the polynomials qf. 

Lemma 8.4. For each i, the polynomial {t + l)qi{t) is strictly positive definite. 

Proof. This polynomial is orthogonal to all polynomials of degree less than i with 
respect to (1 — t)dfi(t), so for some constant c it equals pi+i(t) + cpi{t) in terms 
of the monic orthogonal polynomials for this measure. Setting t — ~1 shows that 
c > 0, because each pj has all its roots in (—1, 1) and hence Pj{—1) has sign (—1)^. 
As before, it follows from Lemma 13.31 and Proposition 13.21 that each pj is strictly 
positive definite. □ 

Lemma 8.5. Let f{t) he a monic polynomial, with 0-th coefficient fo in terms of 
Co(t),Ci(0,.... 

(1) //deg(/) < 2m - 2 and f{t,) = forl<i<m, then fo\C\ = /(I). 

(2) Ifdeglf ) = 2m - 1 and f it-) ^ for 1 < i < m, then fo\C\ < /(I). 

(3) Ifdeglf) < 2m - 2 and fit) >0 for -1 <t<l, then fo\C\ > /(I). 

Proof Write 

dog(/) 



fit) = V f,Q{t). 



i=0 



In cases (1) and (2), 
while in case (3), 
On the other hand. 



J2 /(cosd(x,y)) = |C|/(l), 



x,y£C 



^ /(cosd(x,y)) > |C|/(1). 

x,y£C 



dcg(/) 

^ f {cos d{x,y)) = ^ fi Ci{cosd{x,y)). 

The i = term yields /o|Cp, while all terms with 1 < i < 2m — 2 vanish because 
C is a (2m — 2)-design. In case (1) that yields /o|C| = /(I) and in case (3) it 
yields /o|C| > /(I). Finally, in case (2) we have /2m-i > because / is monic, 
so the i — 2m — 1 term is positive (because C is not a (2m — l)-design) and hence 
fo\C\ < /(I). □ 

Lemma 8.6. If f is a polynomial with deg(/) < i, then {t + l)f{t)qi{t) has 0-th 
coefficient equal to /(I) times that of (t + l)qi(t) (in terms of Co{t), Ci{t) , . . . ) . 

Proof. The 0-th coefficient of a polynomial p{t) equals 

Ip{t)dfiit) 
fdtiit) 

Divide f{t) by t - 1 to yield f{t) ^ g{t){t - 1) + /(I), with deg(.g) < i. Then 
f{t + l)f{t)q,{t) dll(l^ = I 9{t)q^{^^ (1 - t')dAi(t) + /(I) /(I + t)q,{t) dll(t). 
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Because qi is orthogonal to all polynomials of degree less than i with respect to 

{i-e)dii{t), 



{t + l)/(t)g,(t) d^Ji{t) = /(I) y (1 + t)q,{t) dfl{t), 

which completes the proof. □ 

Let Pi denote the 0-th coefficient of (t + l)qi{t) in terms of Co{t),Ci{t), . . . . 
Taking f(t) ^ {t + l)F(t) in case (1) of Lemma lO we find that 

+ /3„_2a)|C| = 2g„,_i(l) + 2g„_2(l)a. 

For an upper bound on \C\, take f{t) — (t + l)F{t)qm-i{t) in case (2). By 
Lemma 18.61 

(because F(l)/3„i_i > 0). For a lower bound, take f{t) = {t + l)g,„_2(i)^ in case 
(3). By Lemma ESI 

Id > /(I) ^ 2g„_2(l) 

'?m-2(l)/3m-2 /3m-2 

(because gm-2(l)/3m-2 > 0). We have shown that 

2(7,„_i(l)-/3„.i|C| > 

and 

2q^_2{l) - f3,n-2\C\ < 0, 

while 

2g™-l(l) - f3ra-l\C\ + a(2q,„_2(l) - /3™-2|C|) = 0. 

It follows that a > 0. 

To prove that {t + l)F{t)'^ is strictly conductive using Lemma [5. 31 we need only 
show that all its partial products 



i=2 

and 

3 
i=2 

are strictly positive definite. That follows from Theorem 13.11 Lemma 18.41 and 
the fact that the set of nonnegative linear combinations of go, 9i, 92, • ■ ■ is closed 
under taking products (see (Gas for the most general case or Theorem 5.2 in |Asl 
p. 44] for a simpler proof that applies to these orthogonal polynomials). Once the 
conductivity of (i-l-l)F(i)^ has been established, the rest of the proof of Theorem] 
is essentially identical to that of Theorem 11.21 
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9. The Euclidean case 

One can generalize ProDOsition l4. ll to apply to configurations in Euclidean space. 
Here we outline this approach, which leads to an optimization problem reminiscent 
of the Beurling-Selberg extremal functions (see 0). We deal only with the case 
of periodic, discrete point configurations (unions of finitely many translates of a 
lattice), to avoid worries about how potential energy is defined in pathological 
cases. 

Define the density of a periodic arrangement to be the number of points per unit 
volume in space. If the arrangement consists of the N translates K + vi, . . . ,K + vm 
of a lattice A g R", with Vj - Vk ^ A for j ^ k, then the density is N/ vol(R"/A). 
Given a potential function /: (0,oo) [0,oo), define the potential energy of the 
packing to be 

l<j,k<N x£A,x+Vj-Vk^O 

Of course summing over all pairs of points in the packing would give a divergent sum 
(even this sum may diverge, in which case we say the potential energy is infinite). 
The potential energy is the average over all points in the configuration of the sum 
over all distances to other points. In particular, the same periodic arrangement 
can be written in terms of different A and vi, . . . ,vn, but the potential energy is 
independent of this choice. 

For a different point of view, imagine placing finitely many points vi, . . . , vjsf on 
the flat torus R"/A. The energy of a point is given by a sum over all geodesies 
connecting it to the other points or to itself, and the energy of the configuration is 
the average of this quantity over all the points. 

Let Bji{0) denote the ball of radius R about 0. Given a configuration C C R", 

let 

CR = CnBR{0). 

Lemma 9.1. Let C be a periodic point configuration in R". // the potential energy 
of C is finite, then it equals 

The same holds even without the assumption that f is nonnegative, as long as 
f{x) = 0{{1 + |2;|)-"/2-'5) for some S > 0. 

Proof. Suppose C is the disjoint union of A + vi, . . . , A + vn as above. Say a point 
in C has type j if it is in A + vj . With each point x e C we associate the sum 

(9.1) J2 fii^-vH- 

veC,v^x 

This sum depends only on the type of x, and the definition of the /-potential energy 
of C equals the average over all types of this quantity. 

All these sums must converge, because of the bounds on / if / is not assumed 
nonnegative or else because of the hypothesis that the potential energy is finite. 
Let e > be small, and choose K such that for x of each type, 

E \f{\^-y\')\<e- 

y€C,\y-x\>K 
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In particular, summing in (|9.1|l only over y such that |a; — ?/| < K yields a partial 
sum within e of the total sum. 
In the sum 

we can focus our attention on the points x € Bji-K{0), since the number in Br{Q)\ 
Br-k{0) is negligible compared to the number in Bn-KiO) (on the order of i?"^^ 
rather than i?" as R oo). Each point contributes a bounded amount to the sum, 
so omitting the points in Bji{0) \ i3/?_A'(0) changes the sum by 0(i?"~-^). 
For each x G Cr-k, 

is within e of 

E f{\-~v\')- 

The numbers of points of each type in Cr-k are equal, up to a factor of 1 + 0(l/i?) . 
Thus, for R large, 

x.y^CR.xi^y 

is 0(l/i?) plus a quantity within e of the /-potential energy of C. Choosing e 
arbitrarily small and R accordingly large completes the proof. □ 

The Euclidean analogue of Proposition 14.11 is stated in terms of the Fourier 
transform on R". We normalize the Fourier transform of an function h : M" M 

by 

h{t) = [ /i(2:)e2'^*<^'*> dx. 

Lemma 9.2. Let C be a periodic point configuration in M", with density 5. If e > 
is sufficiently small ( depending on C ), then 

|2 



lim / ' , '-dt^5. 



e 



2TTi{x^t) I 



Proof. We will prove that for e > sufficiently small, if g is any radial, smooth 
function supported in B^(fS), then 

f g27ri(x,t)|2 

hm / 1^-^"" ^-g{t)dt = 5g{Q). 

Because the characteristic function of a small ball can be bounded above and below 
by such functions, that will suffice to prove the lemma. 
We expand the numerator of the integrand via 

2 



E 

xeCn 



^2-Ki{x.t) 



E - 



^2-ni{x~y,t) 

x,yeCR 



It follows that 
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Let C be the disjoint union of translates A + wi, . . . , A + wat of a lattice A. It follows 
from Lemma l9. II that 

' x,yeCR l<j.k<N zeA 

(This quantity is not quite the same as that considered in Lemma [9. II because we 
allow X = y OT z + Vj — Vk — in the sum, but the extra terms amount to ^(0) on 
each side.) 

Recall the Poisson summation formula, which states that for every Schwartz 
function h : R" — > R and every v e M" , 

V h(z + v)^ ,J V /J(i)e"2"<^'*) , 

^ ^ ^ vol R"/A ^ ^ ' 

where 

A* = {< e R" : {t, z) for all z G A} 

is the dual lattice. (See Appendix A of |CK2| for the standard proof.) Taking h = g 
yields 

2 



1 E Eg(" + ^^--"^)= iVvol(R"/A) ^'^'^ 

l<j,k<N z&A ^ ' ' teA* 



E - 

1<J<A' 



If the support of g is sufficiently small, then only the i = term contributes to the 
right side, and it equals Sg[^) because TV/ vol(R"/A) = 8. □ 

Proposition 9.3. Let /: (0, oo) [0, oo) he any function. Suppose h: R" R 
satisfies h{x) < f{\x\'^) for all a; G R" \ {0} and is the Fourier transform of a 
function g G ^^(R") such that g{t) > for all t G R". Then every periodic 
configuration in M" with density 6 has f -potential energy at least 

6{\imjnig{t))-h{0). 

Without loss of generality, one can assume that g and h are radial functions 
(replace g with the average of its rotations about the origin). For simplicity of 
notation, we typically write g = h. That is justified because g is in fact the Fourier 
transform of /i as a tempered distribution (when these functions are radial; of course 
in general g{t) = h{—t)), but note that we have not assumed that h is integrable 
and thus it is not clear that 

converges. By contrast, h{x) does equal 

J ft(t)e2"<^'*> dt. 

Proof. Define 

£ = liminfo(t). 

Let £ > 0, and choose r/ > such that g{t) > £ — e whenever \t\ < ij. 
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Let C be a periodic configuratioii with density S. By Lemma 1(01 the /-potential 
energy is 

x,yeCR,x^y 

Because h{x) < /(|a;p) for all x ^ 0, 

x,yGCR,x^y x,y€Cft 

On the other hand, 

|\p 27i-i<£!;,t) |2 

^ 27ri(a;,t)|2 
> / 

In the limit as i? — > oo. Lemma 19.21 implies that the potential energy is at least 

-/i(0) + (S(^-e), 

if T] is sufficiently small. Because e can be taken to be arbitrarily small, the desired 
lower bound follows. □ 

This proposition is closely analogous to the linear programming bounds for 
sphere packing introduced in |CE| (see also |Col ICKIL ICK2| ). The proof given 
here is somewhat different, because we have arranged it to minimize the hypothe- 
ses on h. If one is willing to assume stronger hypotheses, namely bounds on the 
decay rates of h and /i, then one can give a proof closer to that in 'CE' by moving 
the Poisson summation from the proof of Lemma [9. 21 into that of Proposition |^| 

Analogously to the case of Proposition l4.1l one can derive the bounds from |CE| 
(with weaker hypotheses than in that paper) by setting 



oo for r < 1, and 
otherwise. 



Then one concludes that the sphere centers in a packing with balls of radius 1/2 
must have density at most /i(0)//i(0) (assuming h is continuous at 0). 

Let A2, Ag, and A24 denote the hexagonal lattice in M^, the £'§ root lattice in 
R®, and the Leech lattice in R^'', respectively. 

Conjecture 9.4. Let n e {2,8,24}, and let /: (0, 00) — > R 6e completely mono- 
tonic and satisfy f{x) — 0(|a:|^"/^^^) as \x\ 00 for some e > 0. Then there 
exists a function h that satisfies the hypotheses of Proposition Iff. 31 and proves that 
A„ has the least f -potential energy of any periodic configuration in R" with its 
density. 

One consequence of this conjecture is that these lattices are universally optimal 
configurations in Euclidean space. That appears to be a rare property, which 
provably fails for lattices in dimensions 3, 5, 6, and 7. The unique optimal lattices 
in those dimensions are the A3, D5, E^, and Ej root lattices (see | Mar| ] . Each 



UNIVERSALLY OPTIMAL DISTRIBUTION OF POINTS ON SPHERES 139 

of these lattices has higher energy than its dual lattice for the potential function 
f{x) — e~^, when both lattices are rescaled to have unit density. 

Montgomery proved in |Mo| that the hexagonal lattice is universally optimal 
among all lattices in R^, but that is a weaker assertion than universal optimality 
among all periodic point configurations. Sarnak and Strombergsson conjectured in 
|SS| that D4, Eg,, and the Leech lattice are universally optimal among lattices and 
proved that they are local optima. Perhaps one could combine their theorem with 
the techniques in this paper and |CK1| to prove that Eg and A24 are the global 
minima among lattices in their dimensions. 

If true, Conjecture 19.41 implies that £"§ and the Leech lattice are the densest 
sphere packings in their dimensions, by using the potential function /(r) = l/r" 
with s large. (The density of a sphere packing is the fraction of space covered by the 
spheres; unfortunately this terminology conflicts with the density of a point configu- 
ration. We will refer to these numbers as the "packing density" and "point density" 
to avoid confusion.) The potential energy of any periodic packing is asymptotic to 

K 

^min 

as s — + CXI, where rmin is the smallest distance between distinct points and K is 
the average number of points at distance rmin from a point in the packing (i.e., 
the average kissing number if one centers balls of radius rmin/2 at the points of 
the arrangement to form a sphere packing). Universal optimality implies that r^ain 
must be as large as possible for a periodic configuration of point density 1, which 
means the packing density is maximized. Thus, Conjecture 19.41 implies that no 
periodic packing can exceed the packing density of Eg, or the Leech lattice, and 
therefore no packing can (see Appendix A of [DEI)- 

Furthermore, Coniecture l9.4l imDlies that £^8 and the Leech lattice are the unique 
densest periodic packings in and M^'*. In the notation used above, any equally 
dense periodic packing must have at least as large an average kissing number K, 
or the potential energy would be lower. Because Eg and A24 have the greatest 
possible kissing numbers in their dimensions and those kissing configurations are 
unique (see Chapters 13 and 14 of |CSj ). the kissing configurations in the hypo- 
thetical packing are determined up to orthogonal transformations. However, those 
orthogonal transformations may be nontrivial, so it is not obvious that the packing 
is uniquely determined. 

Lemma 9.5. For n ^ 8 or n = 24, there is a unique sphere packing in R" such 
that each kissing configuration is the A„ kissing configuration. 

Proof. Without loss of generality, suppose one sphere is centered at a point in A„ 
and the spheres tangent to it are centered at the nearest points in A„. We must 
prove that each sphere is centered at a point of A„. Call a sphere center good if 
it is in A„ and the same is true for all spheres tangent to it, and call one point 
a neighbor of another if they are the centers of tangent spheres. By assumption, 
at least one point is good; call it the origin. We will prove that all neighbors of 
good points are also good. It follows that all points in the packing are good. To 
see why, suppose x is not good with |a;| minimal. Then x must have a neighbor y 
with \y\ < since otherwise a suitable scalar multiple of x (chosen between x and 
the origin to match the minimal distance of the packing) could be included with 
the neighbors of x to increase the kissing number in R"; the reason is that every 
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neighbor of x would be on or outside the sphere of radius |x| and hence at least 
as far from the new point as x is. The smaller neighbor y is good by assumption, 
which implies that x is good. 

It remains to prove that all neighbors of good points are good. Suppose x is good 
and y is a neighbor of x. We must show that there is a unique way to arrange the 
neighbors of y (namely, as points of A„). The positions of the common neighbors 
of X and y are already determined, because x is good. It would sufhce to show 
that every isometry of R" that fixes x and y and preserves the set of common 
neighbors must preserve A„, because any two possible kissing arrangements about 
y are related by such an isometry. (They are related by some isometry fixing y 
by the uniqueness of the kissing configuration. Because the symmetry group acts 
transitively on the neighbors (see Lemma 1.10 in |Ebl p. 25] and Theorem 27 in 
|CSI p. 288]), we may assume that x is fixed as well, and it follows that the set 
of common neighbors is also preserved.) For n = 8 we will show that this is true; 
for n = 24 it is false, but we will prove a slightly weaker statement that will still 
suffice. 

Without loss of generality, we can translate the origin so that y = 0. When n ~ 8, 
X and its 56 common neighbors with span E'g, so every orthogonal transformation 
that preserves x and the common neighbors clearly preserves E^- That completes 
the proof by showing that the kissing arrangements about two neighbors can only 
be in one orientation relative to each other. 

When n — 24, x and its 4600 common neighbors with do not span A24 (their 
span cannot contain any vector that has an odd inner product with x). Instead, 
they span a sublattice L of index 2, as is not hard to check using Figure 4.12 in 
|CSI p. 133] (with X ~ (4, —4, 0, . . . , 0) /a/8)- This lattice contains all the minimal 
vectors from A24 that have even inner product with x, and it contains twice the 
minimal vectors from A24 that have inner product ±1 with x. 

The dual lattice 

L* ^{we M" : {w, z) eZ for all z £ L} 

contains L (because L is integral) , and considering covolumes shows that [L* : L] = 
4. The Leech lattice is contained in L* , as is the vector x/2, and they must generate 
L* because they generate a lattice in which L has index 4. 

Every orthogonal transformation that preserves x and the set of 4600 common 
neighbors preserves L and hence L* . That means the kissing configuration of our 
packing at must be contained among the vectors of norm 4 in L* , because there 
exists such an orthogonal transformation that yields the Leech kissing configuration, 
which is contained in L* . Besides the minimal vectors of A24, the only other 
possibilities are the vectors of the form ±(x/2 — z) where z is a minimal vector 
of A24 satisfying = 1. (Any vector in L* but not A24 must be of the form 

x/2 — w with w E A24. A short calculation shows that if \x/2 — w\'^ = 4, then either 
w = z with jzp = 4 and {x, z) = 1 01 w = x — z for such a z.) 

The kissing configuration must contain all the minimal vectors of L, namely the 
minimal vectors of A24 that have even inner product with x, because L is preserved 
under the transformation to the Leech kissing configuration. It must then either 
contain zLz for all z G A24 with |zp = 4 and (x, z) = 1, in which case it is simply all 
the minimal vectors in A24, or it must contain all the vectors ±{x/2 — z) for such z. 
(If it contains one of the points ztz, then it contains all of them because any one of 
them together with L spans A24 and hence the orthogonal transformation would be 
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a symmetry of A24.) One can check as follows that the second possibility is indeed 
a valid kissing configuration about 0. If r denotes reflection in the hyperplane 
through the origin orthogonal to x, then the map z 1— s- —r(z) preserves L, maps x 
to X, and interchanges the two kissing configurations. 

We have shown that there are only two distinct kissing configurations about 
that both contain x and the 4600 common neighbors. Fortunately, only the first can 
occur in the sphere packing. Suppose the kissing configuration about contained 
x/2 — z with z e A24, — 4, and {x,z) = 1. Because x is good, its kissing 
configuration contains the point x — z, which is so close to x/2 — z that the spheres 
centered at those points would overlap. That contradiction rules out the second 
case, so must be good as well, which completes the proof. □ 

Under Coniecture l9.4l let hs prove a sharp bound for f{r) = l/r''. It seems likely 
that, when suitably rescaled, hg converges as s — > 00 to a function that proves a 
sharp bound in the setting of |CE| . (One can check that this holds for the functions 
constructed below when n = 1.) That would put the mysterious functions from that 
paper into a broader context and exhibit them as limits of families of functions. 

We have not been able to prove Conjecture 19.41 but we can prove the one- 
dimensional analogue. Of course, minimizing potential energy in is completely 
trivial, so this result has no interesting geometric consequences. However, it is not 
obvious that Proposition 19 . 31 proves a sharp bound even in this trivial case, so we 
view its sharpness as an argument in favor of Conjecture 19.41 

Proposition 9.6. Let /: (0, 00) — s- M 6e completely monotonic and satisfy f{x) = 
0{\x\~^^'^^^^ for some e > as \x\ —> 00. Then there exists a function ft,: M — > M 
such that h[x) < f{x^) for allx S R\{0}, h is the Fourier transform of a continuous 
function h: M. ^M. with support in [—1, 1], h{t) > for all t gM., and 

fcGZ\{0} 

For notational convenience, define f{x) — f[x^) for a; G K\ {0} (and let f'{x) — 
2xf'{x'^) denote its derivative). Then we can take 

fcez\{o} ^ ^ ^ ^ feez\{o} ^ ^ 



Sm TTX 



TTX 



2 CO 



k=l 



(Of course we define h{x) by continuity when x G Z.) By the mean value theorem 
and the fact that /' is increasing and nonpositive, 



fix) - f{x/2) 



x/2 

which implies that \xf'{x)\ — 0{jx\^^^^^^y, thus, the sum of /c/'(/i;) in the definition 
of h converges, because f'{k) — 2fc/'(fc^). 

We have no reason to think that these functions are the only ones that prove a 
sharp bound in Proposition l9.3l for n = 1, but they are the only ones we know how to 
construct. We prove below that they satisfy the properties listed in Proposition l9.6l 
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The construction of h is based on a variant of the Shannon sampUng theorem 
for band-hmited functions (see Theorem 9 in V for the particular interpolation 
formula we use). It is natural to search for a function h such that h vanishes 
outside [—1,1], by analogy with the one-dimensional construction in Section 5 of 
|CE| . One can reconstruct such a function from its values and derivatives at the 
integers, assuming h is integrable, via 

fcez ^ ^ ^ fcez ^ ' 

We will not require this general fact, but it is important motivation for the definition 
ofh. 

It is easy to check from the definition of h that h{k) — f{k) and h' {k) = f'{k) 
for k G Z \ {0}. These conditions are necessary if h is to prove a sharp bound in 
Proposition 19. 31 We also have h'{0) — (in fact, h is even). However, the value of 
h{0), which is given by 



h{0) = -2j2{f{k) + k.nk)), 



k=l 



is more mysterious. Setting h{0) — /(O) would make no sense, because /(O) = /(O), 
which is frequently undefined. The value we have chosen for h{0) makes h have 
the right properties, but we know no motivation for it that does not rely on some 
calculation. 



Because 

' sin TTx 



1 

(1- |i|)e2^**^di 

-1 



and 



(sin7rx)2 _ sgni .^^^^ 



_i 2'Ki 



h is the Fourier transform of a function h supported on [—1, 1]. Because h is even, h 
is also even, so it suffices to compute h{t) for t £ [0, 1]. (Of course the simplification 
is inessential, but it avoids absolute value signs.) For such t, one finds that 



h{t) = 2(l-t)^/(/c)cos27rfct- i^/'(fc)sin27rfct 
-2(l-i)^(/(fc) + fc/'(fc)). 

k=l 

One motivation for the value of h{Q) is that if one formally computes h'{l), one 
gets the answer 0. However, we do not require this fact and make no attempt to 
justify it. 

To complete the proof of Proposition 19. 61 we must prove that h{x) < f{x^) for 
all X and h{t) > for all t. One potential approach is to reduce to the case of 
f{x) — e~'^^ with c > 0: by Bernstein's theorem (Theorem 12b in [Widl p. 161]), 
there exists a weakly increasing function a : [0, oo) — > R such that for all a; > 0, 

/■oo 

fix) = / e-^-da(c). 
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We can thus assume without loss of generahty that f{x) — e^"^^ with c > 0. (There 
is no contribution from c ~ because if there were, then / would not decay at 
infinity.) To see that the inequalities we need are preserved under this reduction, 
we must check that integration over c commutes with the sums over k in the formulas 
for h and h. That follows from Fubini's theorem. 

We have not fully carried out this approach, but it is likely possible. For the less 
general case of f{x) = e""^^, which corresponds to the special case when not only 
/ but also X I— > fi^"^) is completely monotonic, the inequality h{x) < f{x'^^ is very 
nearly proved in Theorem 8 of [(iiVj . which constructs similar functions to solve a 
related optimization problem. That paper does not contain precisely the inequality 
we need, but a simple modification of the techniques used there suffices, and one 
can furthermore prove that h(t) > by computing h explicitly and manipulating 
it. This proof suffices for potential functions / such that x i— > f{x^) is completely 
monotonic, but it is somewhat complicated and unilluminating. 

Here we check these inequalities (and hence prove Proposition 19. 6() via a more 
conceptual approach, by proving that they follow from the corresponding assertions 
about codes on by taking limits. Consider N points on S^, under the completely 
monotonic potential function g; we will define g in terms of / shortly. In the proof 
of Theorem 11.21 we constructed an auxiliary function h^^g that proved a sharp 
bound for the g-potential energy in Proposition^^ When N is odd, one can write 
it down explicitly as follows. Define F{t) — (T7v(t) — l)/(i — 1), where T/y is the 
A^-th Chebyshev polynomial, given by Tn{cos6) — cos N6. (The roots of F are the 
inner products other than 1 in the regular A^-gon.) 

Lemma 9.7. When N is odd, 

r:F{r)=0 ^ ' 

where 

ttr = 2(l-r)2(l + r)3(2-2r) 

and 

br = -4(1 - r)2(l + r)g'{2 ~ 2r) - 2(1 - r)(l + 2r)g{2 - 2r). 

The sum is over all roots r of F, each included only once despite the fact that 
they are double roots. The proof of this formula amounts to verifying that the right 
side is the Hermite interpolation of 1 1— > 17(2 — 2t) to second order at the roots of F, 
which is a straightforward calculation. (The case of even N differs slightly because 
one must treat the single root r = — 1 of F differently. We will assume without loss 
of generality that TV is odd, but the even case can easily be handled by the same 
techniques.) 

We define g by 

5(x)-/(iVV(4^'))- 
The motivation is that as — > 00, 

g{2 ~2cos(2ttx/N)) = J{{N/ii) sin(7ra;/iV)) -> /(x). 



which suggests the following lemma. 
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Lemma 9.8. For g{x) = /{N'^x/ {Att'^)) , we have 



^im h]y^g(^cos{2TTx/N)) — h{x) 



as N ^ oo with N odd. 



The same is also true for N even, but we will not need it. Note that Lemma 1^751 
makes no assertion as to the uniformity of the convergence (pointwise convergence 
will suffice for our purposes). 

Proof. At a formal level, it follows from taking limits term by term and using 

^.^ g'(2-2cos(2W^)) _ fjx) 
N-^co {N/{2n)y 2a; 

and 

i^(cos(27ra;/7V)) / simrx^ 
lim — ' 



that hN^g{ cos{2TTx/N)) converges to 

sm'Kx\'^ ^ / J., . /2kx - 2kx + k'^\ ~,/7n 

which is equal to h{x). (The roots of F are of the form cos(27rfc/iV), which is how 
the sum over r becomes a sum over k.) To prove rigorously that 

lim hN^g(^cos{2nx/N)) = h{x), 

one must justify taking the limit term by term. 

To do so, we will apply the dominated convergence theorem. Set 

r = cos(27rfc/Af) = 1 - 2 sin2(7rfc/iV) 

for < fc < N/2, and turn the sum over r in the formula for hj^^g{t) into a sum 
over all fc > by including zero terms for all larger fc. Let t ~ 1 — 2sm^{'Kx/N). 
We will prove that the fc-th term in the sum is 0(fc^^^^^) in absolute value, where 
the implicit constant is independent of N. (Of course this calculation uses f{x) — 
0(1x1-1/2-^) and f{x) =0(|a:|-3/2-^).) Because 

OO 

^fc--- 

fc=i 

converges, taking limits term by term is justified by the dominated convergence 
theorem. 

One can calculate that the fc-th term is given by 

+ brjt -r) _ 2{2sl - 1 + 3{s,/sk)' ^ 4^) 4jV^4(l - 4) . 

where to save space we write 

Sx — svli{ttx/N), 
Sk = sin(7rfc/iV), 
fk = f{iN/7rfsm^7rk/N)), 

and 

fL^f'iiN/nfsm^TTk/N)). 
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We always have < k < N/2, and for fixed x we have < a; < N/2 whenever 
N is sufficiently large relative to x. It follows that Sk and Sx are always within 
fixed constant factors of irk/N and ttx/N, respectively. If k is also sufficiently large 
relative to x, then examining H9.2() and applying iV^sf = 0(/c^) shows that the right 
side is 0(fc~^^^^) in absolute value, where the implicit constant is independent of 
N. □ 

The desired inequalities for h and h now follow easily. For each N, 
hN,g{ cos{2TTx/N)) < g{2 - 2 cos(27ra;/iV)) , 
so it follows in the limit as iV ^ oo that 

h{x) < fix), 

as desired. For the other inequality, we apply Bochner's theorem, which says that a 
continuous function /i on M is the Fourier transform of a nonnegative, function 
(which is of course uniquely determined by h, up to a set of measure 0) if and 
only if h is positive definite, in the sense that for all xi,. . . ,xe G M, the £ x i 
matrix whose i,j entry is h{xi — Xj) is positive semidefinite. (See [SI P- 124] or 
[RI p. 303].) That property follows immediately from the corresponding property 
of hN,g, because the set of positive-semidefinite matrices is closed. This completes 
the proof of Proposition 19. 61 
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Appendix A. Uniqueness of spherical codes 

Uniqueness of the A^-gon is trivial. For the simplex, cross polytope, icosahe- 
dron, and 600-cell, uniqueness follows from analyzing the case of equality in the 
Boroczky bound (see )Bol p. 260], or see )BDj for another proof for the 600-cell). For 
the (24,196560,1/2), (23,4600,1/3), (8,240,1/2), and (7,56,1/3) spherical codes, 
uniqueness was proved by Bannai and Sloane in |BS| (see also Chapter 14 of jCSj ). 
For the (6,27,1/4), (5,16,1/5), (22,275,1/6), (21,162,1/7), and (22,100,1/11) 
codes, uniqueness follows from results in the theory of strongly regular graphs. In 
each case there are only two distinct inner products other than 1, so one can form 
a graph out of the vertices by making edges correspond to one particular inner 
product (one obtains one of two complementary graphs, which encode the same in- 
formation). Using Theorem 7.4 and Lemma 7.3 from (DCS) one proves that these 
graphs are strongly regular and determines their parameters. In each case there 
are unique strongly regular graphs with these parameters. For the (6, 27, 1/4) and 
(5,16,1/5) codes, see Theorem 10.6.4 and Lemma 10.9.4 in |GR.| : the remaining 
cases are dealt with in jCGSj . It follows that the Gram matrices of the corre- 
sponding point configurations are uniquely determined, so the configurations are 
determined up to orthogonal transformations. 

For the (23, 552, 1/5) code, uniqueness follows from the uniqueness of the regular 
two-graph on 276 vertices. In any (23,552,1/5) code, the linear programming 
bounds show that only the inner products —1 and ±1/5 occur. Such a code thus 
gives rise to an arrangement of 276 equiangular lines in M.^^, which corresponds 
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to a regular two-graph on 276 vertices. (See Chapter 11 of |GR| for background 
on two-graphs and equiangular line arrangements.) Goethals and Seidel proved in 
[US] that it is unique, which implies the uniqueness of the (23, 552, 1/5) code. 

The last remaining case is the (22, 891, 1 /4) spherical code. A proof of its unique- 
ness is implicit in jCu) (based on results from |SY| ] . For an alternative proof using 
the techniques of Bannai and Sloane, see |('K3| . 
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